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(54) Polypeptides having l-asparaginase activity 

(57) Disclosed are polypeptides which originate 
from mammal, having L-asparaginase activity. The 
polypeptides are easily prepared by applying recom- 
binant DNA techniques to DNAs encoding the polypep- 



tides and they exert satisfactory effects in the treatment 
and/or the prevention for diseases caused by tumor 
cells dependent on L-asparagine, and cause no sub- 
stantial serious side effects even when administered to 
humans in relatively-high dose. 
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D scription 

The present invention relates to L-asparagine amidohydrolytic enzymes, more particularly, to polypeptides which 
originate from mammal, having L-asparaginase activity 

5 L-Asparaginase (EC 3.5. 1 . 1 ) is an enzyme which catalyzes the hydrolytic reaction of L-asparagine into L-aspartic 

acid and ammonia. The studies on the antitumor activity of L-asparaginase started from the following reports: J. G. 
Kidd et al. described the inhibitory action of guinea pig sera on cells of lymphomas in "The Journal of Experimental 
Medicine", VoL98, pp.565-582 (1953), and J. D. Broome et al. evidenced in "Nature", Vol.191, pp. 1,114-1 ,115 (1961), 
that the L-asparaginase activity of the guinea pig sera was responsible for the inhibitory action. It is now understood 

io that the inhibitory action is caused by the lack of L-asparagine, an essential nutrient to proliferate and survive for some 
tumor cells which defect L-asparagine synthetase activity, such as acute lymphocytic leukemia, but not for normal cells. 
The hydrolysis of L-asparagine by L-asparaginase in patients with such tumor cells induces selective death of the 
tumor cells, resulting in the treatment of malignant tumors. 

L-Asparaginase has been studied energetically for its actual use as an antitumor agent, and one derived from 

1$ Escherichia coli is now in use as a therapeutic agent for leukemia and lymphoma. However, L-asparaginase from 
Escherichia coli is merely an external protein for human, and repetitive administration of conventional compositions 
with such L-asparaginase may cause serious side effects such as anaphylaxis shock, urticaria, edema, wheeze and 
dyspnea. These compositions are inevitably restricted with respect to administration dose and frequency. Therefore, 
some proposals to reduce or even diminish such side effects have been given. 

20 As a first proposal, Japanese Patent Kokai No. 11 9,082/79 discloses a chemically modified L-asparaginase from 

Escherichia coli, in which at least 65 % amino acids are blocked with 2-0- substituted polyethylene glycol-4,6-dichloro- 
S-triazine. As a second proposal, human L-asparaginases are disclosed in Japanese Patent Kokai Nos. 320, 68 4/92 
and 19,018/80, where the L-asparaginases are respectively obtained from cultures of human cell lines and human 
urine. While the first proposal has an advantage of that the L-asparaginase from Escherichia coliis easily obtainable 

25 on an industrial scale, it has a disadvantage of that the modifying reaction is difficult to control and the side effects 
couldnt be eliminated completely While the second proposal has an advantage of that unlike L-asparaginase from 
Escherichia coli, the L-asparaginases from human may not substantially induce antibodies even when administered 
to patients, it has a disadvantage of that it is not easy to obtain the L-asparaginases in a desired amount by the 
processes disclosed in Japanese Patent Kokai Nos. 320,684/92 and 19,018/80. 

30 Recently, recombinant DNA technology has advanced remarkably. If a DNA which encodes a desired polypeptide 

is once isolated, it is relatively easy to obtain a transformant which produces the polypeptide by constructing a recom- 
binant DNA, comprising the DNA and a self-replicable vector, followed by introducing the recombinant DNA into a host, 
such as a microorganism, animal- or plant-cell. The polypeptide is obtainable in a desired amount from the culture of 
the transformant. However, no DNA which encodes mammalian L-asparaginase was isolated, and no mammalian L- 

35 asparaginase was produced by recombinant DNA techniques. 

Therefore, it has been in great demand to isolate DNAs which encode active L-asparaginases originating from 
mammal and establish processes to prepare the L-asparaginases on a large-scale by applying the recombinant DNA 
techniques to the isolated DNAs. 

In view of foregoing, the first object of the present invention is to provide a polypeptide which originates from 

40 mamma, having L-asparaginase activity. 

The second object of the present invention is to provide a DNA which encodes the polypeptide. 
The third object of the present invention is to provide a recombinant DNA which containing a DNA which encodes 
the polypeptide and a self-replicable vector. 

The fourth object of the present invention is to provide a transformant obtainable by introducing a DNA which 

45 encodes the polypeptide into a host. 

The fifth object of the present invention is to provide a process to prepare the polypeptide by using the transformant. 
The sixth object of the present invention is to provide an agent for susceptive diseases, containing the polypeptide 
as an effective ingredient. 

The first object of the present invention is attained by polypeptides which originate from mammal, having L-aspar- 
50 aginase activity. 

The second object of the present invention is attained by DNAs which encode the polypeptides. 
The third object of the present invention is attained by recombinant DNAs containing DNA which encode the 
polypeptides and a self-replicable vector. 

The fourth object of the present invention is attained by transformants obtainable by introducing the DNAs into 
55 appropriate hosts. 

The fifth object of the present invention is attained by a process to prepare the polypeptides which comprises 
culturing the transformants and collecting the produced polypeptides from the resultant cultures. 

The sixth object of the present invention is attained by agents for susceptive diseases, containing the polypeptides 
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as effective ingredients. 

FIG.1 is a scheme of the over lap extension method. 

FIG. 2 is a restriction map of the present recombinant DNA pKGPA/WT. 

FIG. 3 is a scheme of the preparation of the present recombinant DNA pBlgGPA/WT. 
5 FIG. 4 is a restriction map of the present recombinant DNA pBlgGPA/WT. 

FIG. 5 is a restriction map of the present recombinant DNA pKGPA/D364stp. 

FIG. 6 is a restriction map of the present recombinant DNA pKHA/MUT1 . 

FIG. 7 is a restriction map of the present recombinant DNA pKHA/MUT2. 

FIG. 8 is a restriction map of the present recombinant DNA pKHA/MUT3. 
10 FIG. 9 is a restriction map of the present recombinant DNA pKHA/MUT3. 

FIG. 9 is a restriction map of the present recombinant DNA pKHA/MUT5. 

FIG. 10 is a restriction map of the present recombinant DNA pBlgGPA/D364stp. 

FIG. 11 is a restriction map of the present recombinant DNA pBlgHA/MUT1 . 

FIG. 12 is a restriction map of the present recombinant DNA pBlgHA/MUT2. 
is FIG. 13 is a restriction map of the present recombinant DNA pBlgHA/MUT3. 

FIG. 14 is a restriction map of the present recombinant DNA pBlgHA/MUT4. 

Explanation of the symbols are as follows: 

The symbols, "Eco Rl a , "Hind\\\", "Not I " and "Xho I", indicate cleavage sites by restriction enzymes, Eco Rl, Hin 
dill, Not\ and Xho I, respectively. 
20 The symbols, °D364stp D , "HA/MUT1 M , M HA/MUT2", M HA/MUT3 U and "HA/MUTS", indicate DNAs encoding the 

present polypeptides. 

The symbol "Ptac" indicates a Tac promotor. 

The symbol "rrnBT^" indicates a region for transcriptional termination, derived from a ribosomal RNA operon. 
The symbol D AmpR° indicates an ampicillin resistant gene. 
25 The symbol "pBR322ori" indicates a replication origin in Escherichia coil 

The symbol M lg sec" indicates a DNA encoding a polypeptide with a signal sequence for secretion of immunoglob- 
ulin. 

The symbol "Emsv" indicates an enhancer from long terminal repeats of Moloney Mouse Sarcoma Virus. 
The symbol "Prnti" indicates a promotor for Mouse metallothionein I gene. 
30 The symbol "Poly (A) n indicates a polyadenylation signal derived from SV40 virus. 

The symbol "BPVr indicates a genome of a bovine papillomavirus. 

The present inventors isolated mammalian DNAs encoding L-asparaginases firstly in the world, from guinea pig 
and human, and succeeded in elucidating their nucleotide sequences. The nucleotide sequences of the DNAs from a 
guinea pig and human are in SEQ ID NOs:15 and 16, respectively. This information is disclosed in Japanese Patent 

35 Application No.42,564/95 (Japanese Patent Kokai No.2 14,885/96) by the same applicant of this application. The 
present invention has been made based on the above information, and provides the polypeptides which originate from 
mammal, having L-asparaginase activity. 

The polypeptides of the present invention are not restricted to their sources or origins so far as they originate from 
mammal and have an L-asparaginase activity. The polypeptides are usually obtainable by the expression of genes 

40 originating from mammal, and usually contain amino acid sequences of SEQ ID NOs:1 to 3, wherein the symbol "Xaa" 
in SEQ ID NO:3 means "glutamine" or "arginine". For example, the polypeptides have any one of amino acid sequences 
of SEQ ID NOs:4 to 9. In view of the technical level in this field, one or more amino acid residues in SEQ ID NOs:4 to 
9 can be substituted relatively easily by different ones without substantial defects of the activity. Despite derived from 
the same DNA, a variety of polypeptides with an L-asparaginase activity may be obtained as a result of modifications 

45 by endogenous enzymes of the hosts after the DNA expression or modifications during purification of the polypeptides, 
depending on the types of vectors and hosts used to obtain transformants or culturing conditions of the transform ants, 
such as ingredients, compositions, temperatures or pHs. The wording "a variety of polypeptides" includes the polypep- 
tides with deletions and/or additions of one or more amino acids at the N- and/or C-termini thereof, or with glycosylations. 
In view of these, the present polypeptides include not only the polypeptide with any amino acid sequence of SEQ ID 

so NOs:4 to 9 but also their homologues so long as they have an L-asparaginase activity. The present polypeptides 
express the activity when exist in multiple forms, preferably, tetramers. 

The polypeptides of the present invention can be usually prepared by the recombinant DNA techniques. In general, 
the polypeptides are obtainable by culturing transformants containing DNAs encoding the polypeptides and collecting 
the produced polypeptides from the resultant cultures. The transformants are obtainable by introducing such recom- 

55 binant DNAs as contain any one of the nucleotide sequences of SEQ ID Nos:10 to 15 and a self-replicable vector into 
appropriate hosts. One or more nucleotides in SEQ ID NOs:10 to 1 5 can be substituted by different nucleotides without 
substantial changes of the encoding amino acid sequences with respect to degeneracy of genetic code. To facilitate 
the expression of the DNA in the hosts, one or more nucleotides in nucleotide sequences which encode the polypeptides 
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or their homologues can be appropriately substituted by different ones. Furthermore, nucleotide sequences which 
encode and/or don't encode one or more amino acids can be added to the 5'- and/or 3*-termini of the nucleotide se- 
quences. 

The DNAs encoding the polypeptides ot this invention include those from natural sources and those by synthesized 

s artificially so far as the polypeptides expressed by them have an L-asparaginase activity. The DNAs can be wild-type 
ones, containing the same nucleotide sequences as those from natural sources, and can be their homologues. 

Examples of the wild-type DNAs include DNAs containing the nucleotide sequences of SEQ ID NOs:15. The wild- 
type DNA is obtainable from natural sources such as guinea pig livers, as disclosed in Japanese Patent Application 
No.42,564/95 (Japanese Patent Kokai No.21 4,885/96) by the same applicant of this invention: (a) constructing a cDNA 

10 library by applying usual methods to purified poly (A) + RNAs from a guinea pig or human liver as materials, (b) applying 
the plaque hybridization method to the cDNA library using oligonucleotides as probes synthesized chemically based 
on partial amino acid sequences of L-asparaginase purified from a guinea pig serum, (c) collecting phage clones 
containing the DNAs encoding the polypeptides of this invention, and (d) manipulating the collected phage clones in 
a conventional manner. The wild-type DNA can be synthesized chemically based on SEQ ID NO:1 5. 

is Examples of DNA homologues to the wild-type ones include DNAs containing any nucleotide sequence of SEQ 

ID NOs:10 to 14. DNA homologues containing the nucleotide sequence of SEQ ID NO: 10 are obtainable by applying 
conventional methods in this field, such as PCR method and methods for site-directed mutagenesis, to the wild-type 
DNA of SEQ ID NO:15 concerning the desired sequence. DNA homologues containing any nucleotide sequence of 
SEQ ID NOs:11 to 14 are obtainable by the methods such as follows: Firstly, A wild-type DNA with the nucleotide 

20 sequence of SEQ ID NO: 16 is obtained by the methods as disclosed in Japanese Patent Application No.42,564/95 
(Japanese Patent Kokai No.21 4,885/96) by the same applicant of this invention, i.e., screening a human liver cDNA 
library. Subsequently, the wild-type DNA is subjected to conventional methods as mentioned above concerning desired 
sequences to obtain the DNA homologues. The DNA homologues can be synthesized chemically based on the nucle- 
otide sequences of SEQ ID NOs:10 to 14. 

25 The present DNAs can be generally introduced into hosts as in forms of recombinant DNAs. In general, each 

recombinant DNA comprises one of the present DNAs and a self-replicable vector. The recombinant DNAs can be 
easily prepared by general recombinant DNA techniques when the DNAs are available. Examples of such self-repli- 
cable vectors include pKK223-3, pGEX-2T, pRL-A., pBTrp2 DNA, pUB110, YEpl3, Ti plasmid, Ri plasmid, pBI121, 
pCDM8, pBPV and BCMGSneo. Among these vectors, pKK223-3, pGEX-2T, pRL-X, pBTrp2 DNA pUB1 10 are suitably 

so used to express the present DNAs in prokaryotic cells such as Escherichia coli and Bacillus sp., while YEp13, Ti 
plasmid, Ri plasmid, pBII 21 , pCDM8, pBPV and BCMGSneo are suitably used to express the present DNAs in eu- 
karyotic cells such as yeasts and animal- and plant-cells. 

To insert the present DNAs into the vectors, conventional methods in this field can be arbitrarily used. Examples 
of such methods contain the steps of (a) cleaving self-replicable vectors with restriction enzymes, (b) introducing the 

36 same cleavage sites, by the same restriction enzymes as used to cleave the vectors, to the 5'- and 3'- termini of the 
present DNAs by applying polymerase chain reaction to form double-stranded DNAs, (c) cleaving the double-stranded 
DNAs by the restriction enzymes, and (d) ligating the cleaved vectors with cleaved DNAs by the action of DNA ligases. 
The recombinant DNAs thus obtained can be easily introduced into appropriate hosts, resulting in limitless replication 
of the DNAs by culturing the transformants. 

40 The recombinant DNAs according to the present invention can be introduced into appropriate hosts such as Es- 

cherichia coli, Bacillus sp., actinomycetes, yeasts and plant- and animal-cells. To introduce the DNAs into Escherichia 
coli, it can be cultured in the presence of the recombinant DNAs and calcium ion. To introduce them into Bacillus sp., 
competent cell methods or protoplast methods can be used. To introduce them into animal-cells, DEAE-dextran meth- 
ods or electroporation methods can be used. Desired transformants can be cloned by applying hybridization methods 

45 or by selecting L-asparaginase producing cells from the cultures. 

The transformants thus obtained produce the present polypeptides intracellular^ or extracel Marly when cultured 
in nutrient culture media. Examples of such media are usually liquid nutrient culture media which generally contain 
carbon sources, nitrogen sources and minerals, and further contain micronutrientssuch as amino acids and/or vitamins 
on demand. The carbon sources usable in the present invention include saccharides such as starch, starch hydro- 

50 lysates, glucose, fructose and sucrose. The nitrogen sources usable in the present invention include organic and in- 
organic compounds containing nitrogen, such as ammonia and their salts, urea, nitrates, peptone, yeast extract, de- 
fatted soy bean, corn steep liquor and beef extract. Cultures containing the present polypeptides can be obtained by 
inoculating the transformants into the above media, culturing them at temperatures of 25-65°C at pHs of 5-8 for about 
1-10 days under aerobic conditions by aeration-agitation method, etc. 

55 The cultures can be used intact as agents for susceptive diseases. However, the cultures are usually treated with 

ultrasonication or cell wall lytic enzymes to disrupt cells, and the present polypeptides are separated by using tech- 
niques such as filtration and centrifugation from the cell-disruptants and purified. Alternatively, the polypeptides can 
be purified from the culture supernatants obtained by removing cells from the cultures by filtration or centrifugation, 
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etc. The present polypeptides can be purified by applying techniques generally used in this field for protein purifications, 
such as salting out, dialysis, filtration, concentration, gel filtration chromatography, ion-exchange chromatography, af- 
finity chromatography, hydrophobic chromatography, isoelectric focusing and gel electrophoresis, and if necessary, 
two or more of them can be applied combination to the supernatants which are separated from insoluble substances 

5 of cell-disruptants, or to the culture supernatants. The resultant purified solutions polypeptides can be concentrated 
and/or lyophilized into liquids or solids depending on their final uses. 

The following experiments explain the present invention in more detail, and the techniques used therein are con- 
ventional ones in this field: For example, the techniques are disclosed by J. Sambrook et al. in "Molecular Cloning, A 
Laboratory Manual", 2nd edition (1989), published by Cold Spring Harbor Laboratory Press, New York, U.S.A., and by 

10 Masami MATSUMURA in "Laboratory Manual for Genetic Engineering" (1988), published by Maruzen Co., Ltd., Tokyo, 
Japan. 

Experiment 1 

15 Expression of wild-type DNA 

Experiment 1-1 

Expression of guinea pig wild-type DNA 

20 

Experiment 1-1 (a) 

Preparation of guinea pig wild-type DNA 

25 A guinea pig wild-type DNA encoding L-asparaginase was prepared by the method disclosed in Japanese Patent 

Kokai No.21 4,885/96 by the same applicant of this invention. The DNA had the nucleotide sequence of SEQ ID NO: 
15. A DNA having a polypeptide-encoding region in SEQ ID NO: 15, i.e., a sequence of containing the nucleotides 
20-1,714 in SEQ ID NO:15, is called "GPA/WT DNA" hereinafter, and the expression product thereof with the amino 
acid sequence of SEQ ID NO: 15 is called "guinea pig wild-type L-asparaginase". SEQ ID NO: 17 shows in parallel the 

30 nucleotide sequence of GPA/WT DNA and the amino acid sequence encoded thereby. 

Experiment 1-1(b) 

Preparation of recombinant DNA 



Ten uJ of 10 x PCR buffer, one ul of 25 mM dNTP mix, one ng of the human wild-type DNA, obtained in Experiment 
1-1 (a), as a template were placed in 0.5 ml reaction tube. The mixture was mixed with, as a sense- and anti-sense- 
primers, an adequate amount of an oligonucleotide chemically synthesized based on the amino acid sequences near 
the N-and C- termini of SEQ ID NO:15, volumed up with sterilized distilled water to give a total volume of 99.5 uJ, and 

40 mixed with 0.5 uJ of 2.5 units/uJ of AmpliTaq DNA polymerase. The nucleotide sequence of the sense primer was 5'- 
AATCTCGAGCCACCATGGCGCGCGCATCA-3', a nucleotide sequence obtained by adding a common nucleotide se- 
quence in animal cells, as shown by M. Kozak in "Nucleic Acid Research", Vol.15, pp.8, 125-8, 148 (1987), to the up- 
stream of a region which encodes the N-terminal amino acid sequence of SEQ ID NO: 15 and then adding to the further 
upstream a cleavage site by a restriction enzyme, Xho I. The nucleotide sequence of the anti-sense primer was 5'- 

45 CTGCGGCCGCTTATCAGATGGCAGGCGGCAC-3' as a complement to a nucleotide sequence obtained by adding 
two termination codons to the downstream of a region which encodes the C-terminus of the amino acid sequence of 
SEQ ID NO:15 and adding a cleavage site by a restriction enzyme, Not I, to the further downstream. The resulting 
mixture was successively incubated at 940C for one min, at 55°C for one min, and at 72°C for 3 min, and the series 
of incubation was repeated 40-times for PCR to amplify DNA. Thus, a DNA containing GPA/WT DNA was obtained 

50 and then cleaved by restriction enzymes of Xho I and Not I to obtain an about 1.7 kbp DNA fragment. Twenty-five ng 
of the DNA fragment was weighed and mixed with 10 ng of a plasmid vector, n pCDM8'', commercialized by Invitrogen 
Corporation, San Diego, U.S.A., which had been cleaved by restriction enzymes of Xho I and Not I. To the DNA mixture 
thus obtained was added an equal volume of the solution I in "LIGATION KIT VERSION 2" commercialized by Takara 
Shuzo, Tokyo, Japan, and incubated at 160C for 2 hours to obtain a replicable recombinant DNA, °pCGPA/WT n . 

55 The recombinant DNA pCGPA/WT was introduced into an Escherichia coli MC1061/P3 strain, commercialized by 

Invitrogen Corporation, San Diego, U.S.A., by competent cell method. The transformant thus obtained was inoculated 
into L broth medium (pH 7.2) containing 20u.g/ml ampicillin and 10 |ig/ml tetracycline followed by cultivation at 37°C 
for 18 hours under shaking conditions. The transformants were collected from the culture by centrifugation and sub- 
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jected to conventional alkafi-SDS method to extract the recombinant DNA pCGPA/WT. The analysis of the pCGPA/WT 
by an automatic sequencer equipped with a fluorophotometer confirmed that it contained GPA/WT DNA, which termi- 
nation codons were I i gated to the 3'-terminus and was I i gated to the downstream of a CMV promotor from the 5'- to 
3'-termini. 

5 The system using COS-1 (ATCC CRL-1650) as a host, which is a cell line derived from a monkey kidney, was 

used to express the DNA in the following Experiments 1 and 2. Since the system is for a transient expression, it has 
a disadvantage that DNAs introduced into transformants could not be stable over several days, and the transformants 
do not produce the desired polypeptides repeatedly However, it is known that the number of copies of the desired 
DNA per cell temporally increases to 10 s when plasmid vectors having a replication origin derived from SV40 virus, 

io such as the above mentioned pCDM8, are introduced into the COS-1 cells. With this point of view, the system has a 
merit that it quite easily analyzes the desired DNA-expression product. 

Experiment 1-1(c) 

15 Recombinant DNA expression in COS-1 cell 

In accordance with the DEAE-dextran method reported by Frederick M. Ausubel et al. in "Current Protocols in 
Molecular Biology" (1 987), chapters 9.2. 1-9.2.3 and 9.2.5-9.2.6, published by John Wiley and Sons Inc., New York, U. 
S.A., the recombinant DNA pCGPA/WT in Experiment 1 -1 (b) was introduced into COS-1 cells for its expression. To 

20 each well of "3046 fl , a plastic multiwell plate, with 6 wells of 3.5 cm diameter, commercialized by Becton Dickinson 
Labware, New Jersey, U.S.A., was added 2.5 ml of DME medium, containing 10 v/v % bovine fetal serum and 1.8 x 
10 5 COS-1 cells. The cells were cultured at 370C in a 5 v/v % C0 2 incubator overnight. After removing the culture 
supernatant by an aspirator and washing the remaining cells with DME medium containing 50 mM Tris-HCI buffer (pH 
7.4), each well was charged with 2.5 ml of DME medium containing 2.8 u.g/ml PCGPA/WT, 50 Mm Tris-HCI (pH 7.4), 

25 0.4 mg/ml DEAE-dextran and 0.1 mM chloroquine, and incubated at 37°C for 4 hours in a 5 v/v % C0 2 incubator. 
Thereafter, the culture supernatant was removed, and the remaining cells in each well were received with 2.5 ml of 10 
mM phosphate buffered saline (hereinafter abbreviated as "PBS") containing 10 v/v % DMSO before incubating at 
ambient temperature for 2 minutes. After removing the supernatant and washing the remaining cells with DME medium 
containing 50 mM Tris-HCI (pH 7.4), each well was charged with 2.5 ml of "COS MEDIUM", commercialized by COSMO 

30 BIO CO. LTD., Tokyo, Japan, followed by cultivation at 37°C for 3 days in a 5 v/v % C0 2 incubator to express the 
desired DNA. As a control, the same experiment was carried out using a plasmid vector, pCDM8. 

After 3 days' cultivation, the multiwell plates with the cultures were subjected thrice to a treatment of freezing at 
-800C and thawing at ambient temperature to disrupt the cells. The whole cultures were transferred to centrifugal tubes 
and centrifuged to remove insoluble components after precipitated, followed by obtaining total soluble fractions, con- 

35 centrating the fractions using membranes, and adjusting the volume of the total soluble fraction per well to give 0.5 ml 
for the following analyses. 

Experiment 1-1(d) 

40 Assay for L-asparaginase activity 

L-Asparaginase activity was expressed by the unit assayed as follows: Samples were placed in 1 .5 ml-reaction 
tubes in 50 uJ each and admixed with 200 uJ of 50 mM phosphate buffer (pH 7.0) containing 1 .4 mg/ml L-asparagine. 
After standing at 37° C for 0, 1 , 2, 4, 6 and 16 hours, L-aspartic acid in the reaction mixtures was quantified by an amino 

45 acid analyzer In parallel, 1 .0, 0.5 and 0.25 unit/ml dilutions of an L-asparaginase from Escherichia co// were provided 
and quantified for L-aspartic acid after incubating at 37° C for 0 and one hour, and based on the increased amount of 
L-aspartic acid, a calibration curve was drawn. By plotting on the calibration curve the increased amounts of L-aspartic 
acid of the samples, the samples' L-asparaginase activities were estimated. The activity of samples with a lower activity 
was estimated based on that assayed after 2 hours or more incubation. One unit activity of L-asparaginase was defined 

50 as the amount that releases one jimo! of ammonia from L-asparagine per minute under the above conditions. 

The total soluble fractions obtained in Experiment 1-1 (c) were treated similarly as above, and expressed their 
activities as total L-asparaginase activities that were detected in the soluble fractions from 1.8 x 10 5 COS-1 cells. As 
a result, the activity of the total soluble fraction in Experiment 1-1(c) was 0.083 unit, and the control gave no activity. 

ss 
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Experiment 1-1(e) 
Western blotting 



5 An anti-L-asparaginase antibody was prepared as follows: An oligopeptide of a sequence Gly-Ser-Gly-Asn-Gly- 

Pro-Thr-Lys-Pro-Asp-Leu-Leu-Gln-Glu-Leu-Arg-Cys was synthesized chemically in a usual manner Keyhole Limped 
Hemocyanin was linked to the C-terminus of the oligopeptide. The resultant was purified and used to immunize rabbits 
in a usual manner. The rabbits were immunized 6 times 2 weeks about, then the whole blood was collected and sub- 
jected to salting out with 50 w/v % ammonium sulfate to obtain an anti-L-asparaginase anti-serum. 

io In accordance with the method reported by U. K. Laemli et al. in "Nature", Vol.227, pp.680-685 (1970), 0.2 ml of 

the total soluble fraction in Experiment 1-1(c) was subjected to 12.5 w/v % SDS-polyacrylamide gel electrophoresis 
(hereinafter abbreviated as "SDS-PAGE"). The polypeptides migrated were transferred to a nitrocellulose membrane 
and subjected to Western blotting using the above anti-L-asparaginase anti-serum, in accordance with the method 
reported by H. Towbin in "Proceedings of the National Academy of Sciences of the U.S.A.", Vol.76, pp. 4, 350-4,354 

*s (1979). For color development, alkaline phosphatase system was used. Comparing with the control and molecular 
weight markers, both the identification of bands specifically stained in the sample and the measurement of the molecular 
weight of each subunit of the L-asparaginase were carried out. The molecular weight markers used were bovine serum 
albumin (67 kDa), ovalbumin (45 Kda), soy bean trypsin inhibitor (20.1 kDa) and a-lactalbumin (14.4 kDa), and stained 
with amide black. The total soluble fraction in Experiment 1-1(c) gave no clear band. 

20 

Experiment 1-1(0 

Measurement of molecular weight on gel filtration 

25 Two ml of the total soluble fraction in Experiment 1-1(c) was subjected to gel filtration column chromatography 

using "HILOAD SUPERDEX 200 COLUMN", with an inner diameter of 16 mm and a length of 60 cm, commercialized 
by Pharmacia LKB Biotechnology AB, Uppsala, Sweden, equilibrated with PBS. Based on the L-asparaginase activity 
of the eluted fractions, the molecular weight of the guinea pig wild-type L-asparaginase in a native form was examined. 
The molecular weight markers used were thyroglobulin (699 kDa), ferritin (440 kDa), catalase (232 kDa), aldolase (158 

so kDa), bovine serum albumin (67 kDa) and ovalbumin (43 kDa). The peak of L-asparaginase activity in the eluted 
fractions was observed in a position corresponding to a molecular weight of about 300 kDa. 

Since no clear band was detected by Western blotting, the molecular weight of the wild-type L-asparaginase in a 
dissociated form could not be detected, while the molecular weight in a native form was estimated to be about 300 
kDa based on the result of gel filtration. The molecular weights of L-asparaginase in a native and dissociated form, 

35 purified from guinea pig L-asparaginase in serum, were respectively estimated to be about 1 90 kDa on gel filtration 
and about 43 kDa on DS-PAGE. As disclosed in Japanese Patent Kokai No.21 4,885/96 by the same applicant of the 
present invention, 3 partial amino acid sequences of a guinea pig L-asparaginase in serum were observed in a region 
of amino acids 10-236 in the sequence of guinea pig wild-type L-asparaginase. While, two consensus amino acid 
sequences essential for the expression of L-asparaginase activity, i.e., SEQ ID NOs:1 and 2, as proposed by E. Harms 

40 in "FEBS letters", Vol.285, pp. 55-58 (1991) based on the results of experiments on L-asparaginase derived from Es- 
cherichia coli, correspond to the sequences of amino acids 16-19 and 114-118 in the amino acid sequence of the 
guinea pig wild-type L-asparaginase. In view of these and the results in Experiment 1 -1 , the present inventors estimated 
that the guinea pig wild-type L-asparaginase may require a region of amino acids about 1-400 in the amino acid se- 
quence to express the activity. In Experiment 2-1, to examine the L-asparaginase activities of C-terminal defective 

45 mutants as homologues of the guinea pig wild-type L-asparaginase, the expression products of DN A homologues from 
a guinea pig were tested for properties and features. 

Experiment 1-2 

50 Expression of human wild-type DNA 

A human wild-type DNA encoding L-asparaginase was prepared according to the method in Japanese Patent 
Kokai No.21 4,855/96 by the same applicant of the present invention. The DNA had the nucleotide sequence of SEQ 
ID NO:16. Hereinafter, a DNA having a polypeptide-encoding region in SEQ ID NO:16, i.e., a sequence of nucleotides 
55 93-1 ,811 in SEQ ID NO: 16, was named "HA/WT DNA", and a polypeptide, as the expression product of HA/WT DNA, 
having the amino acid sequence of SEQ ID NO:16, may be called "human wild-type L-asparaginase\ SEQ ID NO:18 
shows the nucleotide sequence of GPA/WT DNA and the amino acid sequence encoded thereby. 

Except for the template and the sense- and anti-sense-primers, PCR was performed under the same conditions 



7 



BNSDOCID: <EP 0811687A2_L> 



EP0 811 687 A2 



as used in Experiment 1-1 (b). As a template, the human wild-type DNA in Experiment 1-2 was used. As a sense- and 
anti-sense-primers, oligonucleotides with sequences of S'-AATCTCGAGCCACCATGGCGCGCGCG GTG-3* and 5'- 
CTGCGGCCGCTTATCAGACACCAGGCAGCAC-3' were respectively used. The DNA thus amplified was continuously 
treated with the same method as used in Experiment 1-1(b) to prepare a recombinant DNA, n pCHA/WT n . After se- 

5 quencing, the pCHA/WT was introduced into COS-1 cells and expressed followed by analyzing the expression product 
similarly as in Experiment 1 -1 . 

In contrast to the guinea pig wild-type L-asparaginase, the experiment system could not detect the human wild- 
type L-asparaginase activity. It was presumably due to that the human wild-type L-asparaginase had a lower specific 
activity than that of the guinea pig wild-type one, and this forced to examine the properties of expression products by 

10 DNA homologues from human in Experiment 2-2. 

Experiment 2 

Expression of DNA homologue 

75 

Experiment 2-1 

Expression of DNA homologue originating from guinea pig 

20 a termination codon was replaced for the nucleotide sequence in a specific position of the guinea pig wild4ype 

DNA to obtain a DNA homologue: A DNA was obtained by PCR method by replacing a termination codon for a codon 
of the nucleotides 1,090-1,092 or 1,012-1,014 in SEQ ID NO:17. Except for the nucleotide sequence of anti-sense 
primer, PCR was performed under the same conditions as used in Experiment 1 -1 (b). As an anti-sense primer, an 
oligonucleotide with a sequence of 5'-CTGCGGCCGCTTATCATGCCGTGGGCAGTGT-3'or 5'-CTGCGGCCGCTTAT- 

25 CAGCCCAACACGTAGGA-3* was used to prepare the two-types of DNAs. The amplified DNAs were treated similarly 
as in Experiment 1-1(b) to obtain recombinant DNAs, "pCGPA/D364stp" and "pCGPA/L338stp". By sequencing simi- 
larly, it was confirmed that pCGPA/D364stp and pCGPA/L338stp had DNAs, encoding the sequences of amino acids 
1-363 and 1-337 in the guinea pig wild-type L-asparaginase, respectively, and had a termination codon at their 3'- 
termini free of intervening sequences. Hereinafter, the polypeptide-encoding regions of the DNAs are respectively 

30 named n GPA/D364stp DNA" and U GPA/L338stp DNA". GPA/D364stp DNA and GPA/L338stp DNA were ligated in the 
downstream of a CMV promoter in the direction from the 5'- to 3'-termini. The DNAs expression products may be named 
"guinea pig L-asparaginase homologues 0 . 

The above recombinant DNAs were introduced into COS-1 cells and examined similarly as in Experiment 1-1. As 
controls, pCGPA/WT and pCDM8 in Experiment 1 -1 (b) were similarly treated and examined. Table 1 shows the results. 

35 
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55 As shown in Table 1, the activities of the expression products of GPA/WT DNA and GPA/D364stp DNA were 

detected, but not for GPA/L338stp DNA. These results suggest that a region of amino acids 1-363 in the guinea pig 
wild-type L-asparaginase may be enough for sufficiently expressing the L-asparaginase activity. This amino acid se- 
quence, amino acids 1 -363 in the guinia pig wild-type, is SEQ ID NO:4, and a nucleotide sequence which encodes the 



BNSDOCID: <EP 081 1687A2_1_> 



EP0 811 687 A2 



\ 



amino acid sequence is SEQ ID NO:10. The amino acid sequence of the guinea pig wild-type L-asparaginase is SEQ 
ID NO:5. 

Experiment 2-2 

5 

Expression of DNA homologue originating from human 

DN A homologues were prepared by replacing specific codons in the human wild-type DNA with termination codons 
or codons for different amino acids: The DNA homologues were prepared by replacing termination codons for the 

10 nucleotides 1096-1098 in SEQ ID NO: 18 by applying PCR method. Except for the template and the sense- and anti- 
sense-primers, PCR was performed under the same conditions as used in Experiment 1 -1 (b). As a template, the human 
wild-type DNA in Experiment 1-2 was used. As a sense- and anti-sense-primers, the oligonucleotides with sequences 
of 5'-AATCTCGAGCCACCATGGCGCGCGCGGTG-3' and 5'-CTGCGGCCGCTCATTACACCGAGGGTGGCGT-3' 
were respectively used. The amplified DNA was treated similarly as in Experiment 1-1 to obtain a recombinant DNA, 

is n pCHA/E366stp n , and sequenced. It was confirmed that pCHA/E366stp contained a DNA encoding amino acids 1 -365 
in SEQ ID NO: 16 and a termination codon at the 3'-terminus free of intervening sequences. The polypeptide-encoding 
region was named n HA/E366stp DNA", hereinafter. HA/E366stp DNA was ligated to the downstream of a CMV promotor 
in the direction from the 5'- to 3' -termini. 

To change specific codons in DNAs into ones for different amino acids, the over lap extension method reported 

20 by Robert M. Horton et al. in "Methods in Enzymology", Vol.217, pp.270-279 (1993), published by Academic Press, 
Inc., San Diego, U.S.A., was used. The method is summarized in FIG.1 and explained as follows: First, mutagenic 
primers A and B, where the nucleotides to be mutagenized were substituted by desired different ones complementary 
to one another, were prepared. The mutagenic primer A was a sense strand, and the mutagenic primer B was an anti- 
sense strand. A set of 5'- and 3'-terminal primers, which amplify the whole region of the desired DNA, were prepared, 

25 and they were respectively a sense- and anti-sense-strands. Second, conventional PCR was performed using the 5'- 
terminal primer, the mutagenic primer A, and as a template, a DNA with the original nucleotide sequence. In parallel, 
another PCR as was performed using the same DNA as a template, the 3'-terminal primer, and the mutagenic primer 
B. These two PCRs were named "first step PCRs". Third, two DNAs amplified in the first step PCRs were mixed with 
the 5'- and 3'-terminal primers as used in the first step PCRs followed by performing PCR as a second step PCR. The 

30 two DNA fragments amplified in the first step PCRs were used as primers and templates to generate mutagenized 
DNAs, while the 5'- and 3'-terminal primers were used as primers to amplify the mutagenized DNAs. By this method, 
DNAs into which were introduced 7 types nucleotide substituents, i.e., 7 DNA homologues were prepared. The 7 types 
nucleotide substituents and consequent changes of the encoded amino acid sequences are summarized in Table 2. 
The template DNA and mutagenic primers A and B used to prepare the 7 DNA homologues were summarized in Table 

35 3. The 5'- and 3' -terminal primers were respectively equal to the sense- and anti-sense-primers as used to prepare 
pCHA/E366stp in Experiment 2-2. 
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The obtained DNA homologues from human were treated similarly as in Experiment 1-1 to obtain recombinant 
DNAs "pCHA/MUTr. n pCHA/MUT2 p , n pCHA/MUT3°, B pCHA/MUT4 n ( °pCHA/MUT5°, B pCHA/MUT6" and °pCHA/ 
MUT7 n . The expression products of the DNA homologues, obtained in Experiment 2-2, may be named "human L- 
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asparaginase homologues", hereinafter. After sequencing, these DNA homologues were introduced into COS-1 cells, 
followed by expression and assay. As controls, pCHA/WT obtained in Experiment 1-2 and pCDM8 were treated and 
examined. Signal intensities of bands, detected by Western blotting, were evaluated by densitometry to compare quan- 
titatively the expressed products. The results were in Table 4. 
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The results in Table 4 indicate that human L-asparaginases both in the wild-type and in the C-terminal defected 
mutant, i.e., the expression product of HA/E366stp DNA, as the one of the homologues, had a lower specific activity 
than that from guinea pigs, tn addition, these results indicate that the specific activity of L-asparaginases among those 
of point mutants, which some of the amino acids inherent to the human wild-type L-asparaginase were substituted by 

5 different ones, increased to a detectable level. The human DNA homologues such as HA/MUT1 , HA/MUT2, HA/MUT3 
and HA/MUT5, which the expression products gave a detectable level of activity, have SEQ ID NOs:11 to 14, respec- 
tively, and encoding SEQ ID NOs:6 to 9, respectively. 

Based on the results in the above experiments, the present inventors found that polypeptides from mammal may 
require the amino acid sequence of SEQ ID NO:3 (where the symbol "Xaa" meant "glutamine" or "arginine") to express 

10 a detectable level of L-asparaginase activity in the expression and assay systems in Experiments 1 and 2, in addition 
to conventionally known as such amino acid sequences of SEQ ID NOs:1 and 2. The animo acid sequence of the 
guinea pig wild-type L-asparaginase contains the SEQ ID NO:3 in the region the amino acids 298-302. Examples of 
such polypeptides, having all the amino acid sequences of SEQ ID NOs:1 to 3, include those having SEQ ID NOs:4 
and 5 from guinea pigs and those having SEQ ID NOs:6 to 9 from human. 

is Based on the above findings, the present inventors invented the polypeptides having L-asparaginase activity. The 

following examples explain the present invention, and the techniques used therein are conventional ones used in the 
art, and of course, they are not restrictive to the present invention: Example A-1 Polypeptides having L-asparaginase 
activity Example A-1(a) 

20 Preparation of transformant 

Ten uJ of 10 x PCR buffer, one u.l of 25 mM dNTP mix, one ng of the recombinant DNA pCGPA/WT DNA obtained 
in Experiment 1-1 as a template, and an adequate amount of oligonucleotides as a sense- and anti-sense-primers 
synthesized chemically based on the 5'- and 3'-terminal sequences of GPA/WT DNA were placed in 0.5 ml reaction 

25 tube. The mixture was mixed with sterilized distilled water to give a total volume of 99.5 uJ, and 0.5 uJ of 2.5 units/u.l 
AmpliTaq DNA polymerase were further added. The sequence of the sense primer was 5'-GCGAATTCATGGCGCGCG- 
CATCA-3' which was a nucleotide sequence obtained by adding a cleavage site by a restriction enzyme, Eco Rl, to 
the upstream of the S'-terminus of GPA/WT DNA. The sequence of the anti-sense primer was 5'-GCAAGCTTTCA- 
GATGGCAGGCGGCAC-3', which was complementary to a nucleotide sequence prepared by adding a termination 

30 codon to the 3' -terminus of GPA/WT DNA and then adding a cleavage site by a restriction enzyme, Hin dill, to the 
downstream. The above mixture was subjected to 40 cycles of successive incubations at 940C for one min, at 55°C 
for one min, and 72°C for 3 min to perform PCR. By cleaving the amplified DNA by restriction enzymes Eco Rl and 
Hinti\\\, a Eco RI-H/n-dlll fragment with a length of about 1.7 kbp was obtained. Twenty-five ng of the DNA was mixed 
with 10 ng of plasmid vector "pKK223-3 D , commercialized by Pharmacia LKB Biotechnology AB, Uppsala, Sweden, 

35 which had been cleaved by restriction enzymes Eco Rl and Hin dill, and then mixed with the solution I in "LIGATION 
KIT VERSION 2" commercialized by Takara Shuzo Inc., Tokyo, Japan, in an equal volume of the DNA mixture, followed 
by incubation at 160C for 2 hours to obtain a replicable recombinant DNA, "pKGPA/WT". 

The recombinant DNA pKGPA/WT was introduced into an Escherichia cofi strain n JM105 n by the competent cell 
method. The resulting transformant n J-GPA/WT H was inoculated to L broth medium (pH 7.2) containing 50 u.g/ml amp- 

40 jcillin and cultured at 37°C for 1 8 hours under shaking conditions. The transformants collected by centrifugation from 
the culture were subjected to a conventional alkali-SDS method to extract the recombinant DNA pKGPA/WT As shown 
in FIG. 2, analysis using an automatic sequencer equipped with a fluorophotometer revealed that GPA/WT DNA of SEQ 
ID NO:17 ligated to the downstream of a Tac promotor in the direction from the 5'- to 3' -termini. In addition, it was 
confirmed that a termination codon was ligated to the downstream of GPA/WT DNA without intervening sequences. 

45 

Example A-1 (b) 
Production of polypeptide 

so The transformant J-GPA/WT was inoculated into L broth medium (pH 7.2), containing 50 u.g/ml ampicillin, and 

cultured at 37°C for 18 hours under shaking conditions to obtain a seed culture. Eighteen L of a fresh preparation of 
the same medium was placed in a 30-L jar fermenter, inoculated with one v/v % of the seed culture, and cultured at 
37°C under aeration-agitation conditions. A portion of the culture was placed in a cuvette with 1-cm in thickness, 
incubated until the absorbance at a wavelength of 650 nm reached to about 1.5, admixed with IPTG to give a final 

55 concentration of 0.1 mM, and incubated for 5 hours. The cells centrifugally collected from the culture were suspended 
in a mixture solution (pH 7.2) containing 139 mM NaCI, 7 mM Na 2 HP0 4 and 3 mM NaH 2 P0 4 , and supersonicated to 
disrupt the cells, followed by centrif uging the resultant to obtain a supernatant. 

Ammonium sulfate was added to the supernatant under ice-chilling conditions to give a concentration of 50 w/v % 



15 

BNSDOCID: <EP 081 1 687A2J_> 



EP0 811 687 A2 



and then dissolved to homogeneity. After standing for several minutes, the precipitates were collected by centrif ugation, 
dissolved in 20 mM Tris-HCI buffer (pH 8.0), and dialyzed against a fresh preparation of the same buffer followed by 
applying the dialyzed solution to °Q SEPHAROSE FF COLUMN", commercialized by Pharmacia LKB Biotechnology 
AB, Uppsala, Sweden, equilibrated with the same buffer. After washing sufficiently with the same buffer, the column 

5 was fed with a linear gradient buffer of NaCI increasing from 0 M to 0.5 M in 20 mM Tris-HCI buffer (pH 8.0). The 
fractions eluted at about 0.1-0.3 M NaCI were collected, and the solvent was replaced with 10 mM sodium-phosphate 
buffer (pH 7.5) while concentrating with membranes. The concentrated solution was then applied to "L-ASPARAGINE 
AGAROSE", commercialized by Sigma Chemical Co., St. Louis, U.S.A., equilibrated with the same buffer. After washing 
with the same buffer, 10 mM sodium phosphate buffer (pH 7.5) containing 0.5 M NaCI was fed to the column for elution. 

10 The eluted fractions were pooled and concentrated by using a membrane. The concentrate was applied to "HI LOAD 
SUPERDEX 200 COLUMN", commercialized by Pharmacia LKB Biotechnology AB, Uppsala, Sweden, equilibrated 
with Tris-HCI buffer (pH 8.0) containing 10 v/v % glycerol, and eluted from the column. The eluted fractions, containing 
substances with a molecular weight of about 300 kDa, were collected to obtain a purified polypeptide with a purity of 
90% or more in a yield of about 0.1 mg/ml culture. 

75 

Example A-1 (c) 

Physicochemical property of polypeptide 

20 The purified polypeptide in the above was analyzed to determine the physicochemical properties: The molecular 

weight of the purified polypeptide in a native form was determined by gel filtration similarly as in Experiment 1-1(e). 
The peak for L-asparaginase activity of the eluted fractions was found at a position corresponding to a molecular weight 
of about 300 kDa. The molecular weight of the purified polypeptide in a dissociated form was determined by SDS- 
PAGE as used in Experiment 1 -1 (e). The main band was observed at a position corresponding to a molecular weight 

25 of about 50±1 0 kDa. The results indicate that the purified polypeptide exists in a multimer as a native form. Considering 
errors in measurement by the above methods and the fact that all the known L-asparaginases from Escherichia coli 
etc., other than mammal, exist in atetramericform, it can be estimated that the purified polypeptide exists inatetrameric 
form. The method as used in Experiment 1-1(d) confirmed that the purified polypeptide has an L-asparaginase activity. 

30 Example A-2(a) 

Preparation of transform ant 

FIG. 3 summarizes the procedures to prepare transformants. PCR was performed under the same conditions as 

35 used in Example A-1 (a) except for the nucleotide sequences of a sense- and anti-sense-primers. As the sense- and 
anti-sense-primers, oligonucleotides with the nucleotide sequences of 5'-GTGAATTCGGAGGTTCAGAT- 
GGCGCGCGCATCA-3' and 5'-CTGCGGCCGCTCAGATGGCAGGCGGCAC-3' were respectively used. The DNA thus 
amplified was cleaved by restriction enzymes Eco Rl and Not I to obtain an about 1.7 kbp Eco Rl-Not I fragment. 
Seventy ng of the DNA fragment was mixed with 50 ng of a plasmid vector, "pBPV", commercialized by Pharmacia 

40 LKB Biotechnology AB, Uppsala, Sweden, cleaved in advance by restriction enzymes Xho I and Not I, and 25 ng of 
each of 4 oligonucleotides as linkers with nucleotide sequences of 5'-TCGAGCCACCATGAAGTGTTCGTGGGTTATT- 
3', 5-TTCTTCCTGATGGCCGTAGTGACAGGAGTG-3', 5'-AATTCACTCCTGTCACTACGGCCATCAGGA-3',and 5'- 
AGAAAATAACCCACGAACACTTCATGGTGGC-3V The oligonucleotides for linkers were synthesized in a usual man- 
ner and used after reacted with T4 polynucleotide kinase, commercialized by Pharmacia LKB Biotechnology AB, Upp- 

45 sala, Sweden, and purified by ethanol-precipitation. To the DNA mixture was added the solution I in "LIGATION KIT 
VERSION 2", commercialized by Takara Shuzo, Tokyo, Japan. The mixture was incubated at 1 6°C for 2 hours to obtain 
a replicable recombinant DNA "pBlgGPA/WT". 

The recombinant DNA pBlgGPA/WT was introduced into an Escherichia co//HB101 strain by the competent cell 
method. The transformant thus obtained was inoculated into L broth medium (pH 7.2) containing 50 u,g/ml ampicillin 

50 followed by cultivation at 370C for 18 hours under shaking conditions. The transformants, collected by centrif uging the 
resulting culture, were subjected to a conventional alkali-SDS method to extract the recombinant DNA pBlgGPA/WT. 
The nucleotide sequence analysis using an automatic sequencer confirmed that the recombinant DNA pBlgGPA/WT 
had the structure in FIG.4: A DNA encoding a polypeptide containing a signal sequence for immunoglobulin secretion, 
as shown by D. F. Stem etal. in "Science", Vol.235, pp.321 -324 (1984), i.e., "Ig sec DNA" was ligated to the downstream 

55 of a region for transcriptional regulation, comprising an enhancer derived from long terminal repeats of Moloney Mouse 
Sarcoma Virus (Emsv), and a promotor derived from Mouse metallothionein I gene (Pmti). Furthermore, GPA/WT DNA 
was ligated in the same frame to the downstream of the Ig sec DNA in the direction from the 5'- to 3'-termini of GPA/ 
WT DNA. It was also confirmed that a termination codon exists in the 3Merminus of GPA/WT DNA without intervening 
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sequences. 

The recombinant DNA pBlgGPA/WT was introduced into a cell line C1 27 (ATCC CRL-1 61 6), derived from a mouse, 
by using a lipofectin® reagent commercialized by Life Technologies, Inc., Gaitherburg, U.S.A., according to the attached 
protocol. The transformants with the recombinant DNA were selected based on the lack of proliferation-regulatory 
s ability, i.e. , focus-forming ability, as a first selection. The cells around those containing foci were collected using sterilized 
filter papers and subjected to a conventional limiting dilution method to form single cells which were then selected 
depending on the productivity of L-asparaginase, as final selection. Thus, a transformant, "C-GPA/WT M , was obtained. 

Eample A-2(b) 

10 

Production of polypeptide 

The transformant C-GPA/WT was inoculated into a well of "3046", a plastic multiwell plate with 6 wells, 3.5 cm in 
diameter, commercialized by Becton Dickinson Labware, New Jersey, U.S.A., with DME medium containing 10 v/v % 

is bovine fetat serum, and cultured to be confluent as a seed culture. Some of the cells, scraped by treatment with trypsin, 
were inoculated as seed cells into each of the multiwell plates which were charged with the fresh preparation of the 
same medium and cultured. After repeating manipulations similarly as in the above and with scale up to increase the 
cell number, the cells were subjected to a conventional continuous culture using 50 of 150 cm 2 culture flasks. The 
resulting culture supernatants of a volume of 100 1 was collected and treated with similar methods for treating the 

20 supernatant from the cell-disruptants in Example A-1(b): salting out with ammonium sulfate, the chromatography of 
the solution of the precipitates using Q SEPHAROSE FF COLUMN, the chromatography of the eluted fractions using 
L-ASPARAGINE AGAROSE, and the chromatography of the eluted fractions using HILOAD SUPERDEX 200 COL- 
UMN. Consequently, a purified polypeptide with a purity of 90 % or more was obtained in a yield of about one jig/ml- 
cuiture. 

25 

Example A-2(c) 

Phvsicochemical property of polypeptide 

30 By testing similarly as in Example A-1 (c), it was confirmed that the purified polypeptide thus obtained had equivalent 

physicochemical properties with the that obtained in Example A-1(b). 

Example A-3(a) 

35 Preparation of transformant 

PCRs were performed under the same conditions in Example A-1 (a) except for the template and the sense- and 
anti-sense-primers. The DNA thus obtained were treated similarly as in Example A-1 (a) to prepare recombinant DNAs, 
"pKGPA/D364stp n , "pKHA/MUT1 fl pKHA/MUT2", u pKHA/MUT3" and "pKHA/MUT5". Table 5 summarizes template 
40 DNAs and nucleotide sequences of a sense- and anti-sense-primers which were used to prepare the each recombinant 
DNAs. By sequencing similarly as in Example A-1 (a), the structures of these recombinant DNAs were confirmed as 
shown in FIGs. 5 to 9. 



45 




so 



55 



17 

BNSDOCID: <EP 081 1687A2_I_> 



] 



EP0 811 687 A2 



10 



15 



20 



25 



30 



ID 

Q) 
«— I 

XI 
(0 



35 



40 



45 



SO 



c 

-ri 

rH * 

Q) (0 
in 

a a> 
u 

cu a 

CO 

c — 
o) <u 
co c 

•H 
MH iH 
O 

co a> 
a) ? 
o o 

C 

Q) — 

3 

cr a) 
<u co 
co c 

0) 

a) co 

-P -P 

o c 

•H 

3 C 

z to 



Q 

(0 
iH 

§ 



< 
2 
Q 

-P 
C 
(0 

c 

-H 

£ 
O 

o 

0) 

OS 



55 



CD 
t 

- Eh 
co CD 
i H 

? S3 
CJ *C 
Eh U 

«: cd 

CJ cd 
CD 
CJ H 
CD CD 

u cj 
cd u 
a cd 

Eh 
< 
O 

E-* 

O 



1 



52 

a < 
o o 
o o 
t 1 

in in 



a 

-p 

CO 
CO 

a 
< 

O 

a 
cu 



Oi 
•P 
CO 

CO 

a 

< 

cu 
o 





c c 


c c 




o o 


o o 




iH 


■iH -H 




-P P 


4J 




(0 (0 


co (d 




iH (H 


(H ^1 




(d (0 


m to 






Cu cu 




a) d) 


cu a) 




u u 


u u 


_ 


Q* a 


cu cu 


CO 






1 


rH rH 


t— I rH 


- Eh 


Eh E-« 




CO CD 






1 CJ 






cd cd 






H CD 


< < 


< < 


CD Eh 


X X 


X X 


CJ) C!) 




^: 


CJ 


Pi Oi 




cd cd 






u «: 


U U 


u u 


CJ o 


O O 


o o 


u o 


m *h 


m m 


CD CJ 






u «: 




CD CJ 


cu a) 


cu a) 


o < 


CO CO 


co cn 


Eh U 


3 3 


3 3 


«! Eh 






3 H 


CO CO 


CO U) 




(0 (0 


(0 (0 


Eh CJ 

32 


CU 0) 


0 CU 


£ § 


£ £ 


CD < 




(d fd 


cj o 


CO CO 


CO CO 


CD CD 
1 1 




<D a) 


<d a) 




r jC 




in in 


<P 4-> 






<N 


CO 










D 










< 


< 


< 


x 


X 


X 


CJ 


CJ 


u 


Oi 




a 


iH 


CN 


CO 






H 


< 


< 


\ 


S 

a 


Cu 


a 



c c 
o o 

4J -P 
(0 <0 

iH (H 

cd cd 

a a 

<d cu 

u u 

cu a 



X X 

Oi a 
o o 



T3 T3 

0) cu 

CO CO 

3 3 

CO CO 

(0 (d 

0) cu 

E E 

to (0 

CO CO 

0) Q) 



in 



< 

X 

o 
a 



in 



a 



<o o • 

-p c 

MH 03 

O 0) 6 

O 3 

CU C JZ 

O 0) t 

C 3 £ 
0) CT O 

3 a) ^ 

Of CO D> 

c o o 



-H V 

e 

Vh 

a) 
i 



c 

cd 

CU 

g 



in 



co 

Q) 
+J 

cd 
c 

*H 

cn 

•H 
U 
O 



CU 
CO 

a) 
c 



a) 

c ® 

(d ^ 

C rH OJ 

fe JH 

- cu cd 

g x: a 

5 -P to 

5 *° 

n G 1 



(d — 

cu 5 c 

C th CU 

(d $ 
cu x: 
c . s 

cu 

S 3 ° 
£ am 

8° 

- 1 w 

C J 3 

*H C 

co c e 

O -H (H 

cd o i 
P C - 

M O CO 
^ < CU 
Q -P 



BNSDOCID: <EP 081 1687A2_I_> 



18 



EP 0 811 687 A2 v 

The recombinant DNAs were treated according to the methods as in Example A-1 (a) to obtain transformants, "J- 
GPA/D364stp u , "J-HA/MUT1 u , "J-HA/MUT2 U ( M J-HA/MUT3" and U J-HA/MUTS B . 

Example A-3(b) 

5 

Production of polypeptide 

The transformants obtained in Example A-3(a) were treated according to the methods similarly as in Example A- 
1(b): cultivation, disrupting the resulting cells, the precipitations of the cell-disruptants with ammonium sulfate, the 

10 chromatography of the precipitate solutions using Q SEPHAROSE FF COLUMN, and the chromatography of the eluted 
fractions using L-ASPARAGINE AGAROSE in that order. The eluted fractions thus obtained were concentrated using 
membranes similarly as in Example A-1(b) followed by subjecting the chromatography using HILOAD SUPERDEX 
200 COLUMN to collect the eluted fractions with a molecular weight of about 1 40 kDa. Each system yielded the purified 
polypeptide with a purity of 90 % or more in a yield of about 0.1 mg/ml-culture. These purified polypeptides were 

15 analyzed by the methods as in Example A-1 (c) to examine their physicochemical properties. Table 6 shows the results 
combined with those in Example A-1 (c). 
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Table 6 indicates that all of the present polypeptides, expressed in Escherichia coli and purified, expressed an L- 
asparaginase activity. Furthermore, table 6 indicates the that the polypeptides formed tetramers. 

Example A-4(a) 

5 

Preparation of transformants 

PCRs were performed under the same conditions in Example A-1 (a) except for the template and the sense- and 
anti-sense-primers. DNAs thus obtained were ligated with the same linkers as used in Example A-2(a) under the same 
10 conditions as in Example A-2(a) to obtain recombinant DNAs, n pBlgGPA/D364stp M , u pBlgHA/MUT1 D , °pBlgHA/MUT2 u , 
"pBlgHA/MUTS" and M pBlgHA/MUT5". Table 7 summarizes template DNAs and nucleotide sequences of sense- and 
anti-sense-primers which were used to prepare the each recombinant DNAs. By sequencing similarly as in Example 
A-1 (a), the structures of these recombinant DNAs were confirmed as shown in FIGs.10 to 14. 
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The recombinant DNAs thus obtained were treated similarly as in Example A-2(a) to obtain transformants, "C- 
GPA/D364stp°, "C-HA/MUTr, n C-HA/MUT2° J °C-HA/MUT3 n and °C-HA/MUTS n . 

Example A-4(b) 

5 

Production of polypeptide 

The transformants obtained in Example A-4(a) were cultured according to the methods as in Example A-2(b), and 
the resulting culture supernatants were treated with similar methods for treating the supernatants from the cell-dis- 

10 ruptants in Example A-1 (b): the precipitations of culture supernatants with ammonium sulfate, the chromatography of 
the precipitate solutions using Q SEPHAROSE FF COLUMN, and the chromatography of the eluted fractions using L- 
ASPARAGINE AGAROSE in that order. The eluted fractions thus obtained were concentrated using membranes sim- 
ilarly as in Example A-1 (b) followed by subjecting the chromatography using HI LOAD SUPERDEX 200 COLUMN to 
collect the eluted fractions with a molecular weights of about 140 kDa. Each of these systems yielded the purified 

is polypeptide with a purity of 90 % or more in a yield of about one uxj/ml -culture. These purified polypeptides were 
analyzed by the methods as in Example A-1 (c) to examine their physicochemical properties. Table 8 shows the results 
combined with those in Example A-3. 
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Table 8 indicates that all of the present polypeptides, expressed in mammalian cells and purified, expressed an 
L-asparaginase activity. Furthermore, table 8 indicates the polypeptides formed tetramers. 

As shown in above Example A, each of the polypeptides according to the present invention expresses an L-as- 
paraginase activity. Therefore, the present agent for susceptive diseases hydrotyze L-asparagine in patients to exert 

5 therapeutic and preventive effects on L-asparaginase-susceptive diseases when administered to human. The wording 
"susceptive diseases" as referred in the present specification means diseases in general which are caused by the 
existence of tumor cells dependent on L-asparagine: For example, leukemias such as acute leukemia, an acute trans- 
formation of chronic leukemia and T-lymphocytic leukemia, and malignant tumors such as Hodgkin's diseases and 
non-Hodgkin's diseases. The present agent for susceptive diseases possesses thus the uses as anti-tumor agents for 

10 treating and/or preventing such susceptive diseases as above. Although it varies dependently on the types of agents 
used for such purposes and susceptive diseases to be treated, the present agent is generally processed into an agent 
in the form of a liquid, a paste or a solid which contains the polypeptides in an amount of 0.000001 -100 w/w %, pref- 
erably, 0.0001-100 w/w %, on a dry solid basis. 

The present agent can be used intact or processed into compositions by mixing with one or more selected from 

15 the group consisting of physiologically-acceptable carriers, excipients, solvents, buffers and stabilizers, and further, if 
necessary, other biologically-active substances and other agents. For example, "lyakuhin-Tenkabutsu-Jiten (The Dic- 
tionary of Pharmaceutical Excipients)" (1994), edited by Japan Pharmaceutical Excipients Council, Tokyo, Japan, pub- 
lished by Yakujinippo LTD., Tokyo, Japan and "lyakuhin-Tenkabutsu-Jiten-Tsuiho 1 995 (Suppliment for The Dictionary 
of Pharmaceutical Excipients)" (1995), edited by Japan Pharmaceutical Excipients Council, Tokyo, Japan, published 

20 by Yakujinippo LTD., Tokyo, Japan, mention the embodiments of such carriers, excipients, solvents, buffers and sta- 
bilizers. Examples of such other biologically-active substances and other agents include interferon-a, interferon-p, 
interferon-?, interleukin 1, interleukin 2, interleukin 3, TNF-a, TNF-p, GM-CSF, carboquone, cyclophosphamide, aclar- 
bicin, thiotepa, busulfan, ancitabine, cytarabine, fluorouracil, 5-fluoro-1 -(tetrahydro-2-fury I) uracil, methotrexate, actin- 
omycin D, chromomycin A3, daunorubicin, doxorubicin, bleomycin, mercaptopurine, prednisolone, mitomycin C, vin- 

25 cristine, vinblastine, radio gold colloidal, Krestin®, picibanil, lentinan and Maruyama vaccine. 

The present agent for susceptive diseases includes those in a unit dose form which means a physically separated 
and formed medicament suitable for administration, and contains the polypeptides in a daily dose or in a dose from 
1/40 to several folds (up to 4 folds) of the daily dose. Examples of such medicaments are injections, liquids, powders, 
granules, tablets, capsules, sublinguals, ophthalmic solutions, nasal drops and suppositories. 

30 The present agent can be administered to patients orally or parenterally. In both administrations, the agent exerts 

a satisfactory effect in the treatment and/or the prevention for the susceptive diseases. Although it varies dependently 
on the types of susceptive diseases and their symptoms, the agent can be orally administered to patients or parenterally 
administered to patients' intradermal tissues, subcutaneous tissues, muscles, and veins at a dose as amounts of the 
polypeptides in the range of about O.lfig- 500 mg/shot, preferably, about 0.1-100 mg/shot, 1-4 times/day or 1-7 times/ 

35 week, for one day to one year The present agent for susceptive diseases further includes the forms by applying gene 
therapy. When a transformant into which the DNAs encoding the polypeptides of this invention are introduced are 
administered to patients to express in them, they exert equivalent effects as above administrations. For example, 
"Jikken-lgaku Bessatsu, Bio-manual Up Series, Idenshi-Chiryo-No-Kisogijutsu (Basic Techniques for Gene Therapy) 
n (1996), edited by Takashi SHIMADA, Izumi SAITO and Takaya OZAWA, published by Yodosha, Tokyo, Japan, details 

40 the general procedures for the gene therapy. 

The biological activities and acute toxicity of the present polypeptides are explained based on Experiment 3 and 
4 below, respectively. 

Experiment 3 

45 

Biological activity 

Experiment 3-1 

so Antitumor effect in vitro 

A human histocytic lymphoma cell line U937 (ATCC CRL-1593), and a cell line Molt4 (ATCC CRL-1582), derived 
from human T lymphoblasts, were subcultured in RPMI 1640 medium containing 10 v/v % bovine fetal serum. The 
cells collected by centrifugation from each subculturing system in logarithmic phase were suspended in the same 
55 medium to give a concentration of 2 x 10 5 cells/ml. Every one ml of the each cell suspension was charged into each 
of 1 3 wells of multiwell plates with 24 wells, "3047", commercialized by Becton Dickinson Labware, New Jersey, U.S. 
A. After each of dilutions of 12 types of the purified polypeptides prepared in Example A-1 to A-4 with PBS was further 
charged into the each well, the cells were cultured at 37°C for 72 hours in a 5 v/v % C0 2 incubator. The final concen- 
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tration of each of the purified polypeptides was one unit/ml as an L-asparaginase activity. As a control, after charged 
with equivalent volume of PBS, the cells were cultured correspondingly. The cells were collected after the cultivation 
to stain cells died with trypan blue. Cell survival ratio in each systems with the purified polypeptides was compared 
with that in the control. All of the cell survival ratios with the purified polypeptides were significantly lower than that in 
s the control. These results indicate that all of the present polypeptides, obtained in Examples A-1 to A-4, have cytotoxicity 
to U937 and Molt4. 

Experiment 3-2 

10 Antitumor effect in vivo 

For model mice were used C3H mice wherein a mouse lymphoma cell line 6C3HED, registered in Cell Resource 
Center for Biomedical Research, Institute of Development Aging and Cancer, Tohoku University, Sendai, Japan, was 
transplanted with passages by subcutaneous injections at their sides in a range of 1 x 10 7 cells/body every 8 days in 

is usual manner. To the model mice were administered the purified polypeptides obtained in Example A-1 to A-4 in the 
range of 400 unit/body by venoclyses every day from fourth to seventh days after transplanted with the cells. Dimensions 
of the tumors were observed with naked eyes on fourth and eighth day after the transplantations. The purified polypep- 
tides were administered after diluted with 0.15 M NaCI and filtrated with membrane filters, 0.45 u.m in pore size, com- 
mercialized by Millipore Corp., Bedford, U.S.A. As a control, 0.15M NaCI was treated correspondingly. While significant 

20 enlargements of the tumors were observed in the control, significant involutions or disappearances of the tumors were 
observed in mice administered with the polypeptides. These results indicates that all of the present polypeptides, 
obtained in Examples A-1 to A-4, are able to cure the tumors of model mice. 

Experiment 4 

25 

Acute toxicity 

The purified polypeptides obtained in Examples A-1 to A-4 were separately administered to 8-week-old mice per- 
cutaneously, perorally or intraperitoneal^ according to conventional manner. The LD 50 of all the polypeptides was 
30 about 100 mg/kg or higher independently of the administration routes. These results evidenced that the present 
polypeptides could be safely incorporated into pharmaceuticals for administering human. 

The following examples explain the present agent for susceptive diseases. 

Example B-1 

35 

Solution 

The purified polypeptides obtained in Examples A-1 to A-4 were separately dissolved to give a concentration of 
0.1 mg/ml in physiological saline containing one w/v % human serum albumin as a stabilizer, and sterilized with rnem- 
40 brane filters according to conventional manner to obtain solutions. 

All of the products have satisfactory stabilities and can be used as injections, ophthalmic solutions, collunarium in 
the treatment and/or the prevention of susceptive diseases including a malignant tumor, acute leukemia, malignant 
lymphoma, an acute transformation of chronic leukemia, T-lymphocytic leukemia. 

45 Example B-2 

Solution 

The purified polypeptides obtained in Examples A-1 to A-4 were separately dissolved to give a concentration of 
so 0.1 mg/ml in physiological saline containing one wAf % glycerol as a stabilizer, and sterilized with membrane filters 
according to conventional manner to obtain solutions. 

All of the products have satisfactory stabilities and can be used as injections, ophthalmic solutions, collunarium 
for the treatment and/or the prevention of susceptive diseases including a malignant tumor, acute leukemia, malignant 
lymphoma, an acute transformation of chronic leukemia and T-lymphocytic leukemia. 

55 
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Example B-3 
Dry injection 

5 The purified polypeptides obtained in Examples A-1 to A-4 were separately dissolved to give a concentration of 

50 mg/ml in physiological saline containing one w/v % purified gelatin as a stabilizer, and the solutions were sterilized 
with membrane filters according to conventional manner. One ml aliquots of the sterilized solutions were distributed to 
vials, lyophilized and cap sealed. 

All of the products have satisfactory stabilities and can be used as dry injections for the treatment and/or the 

10 prevention of susceptive diseases including a malignant tumor, acute leukemia, malignant lymphoma, an acute trans- 
formation of chronic leukemia and T-lymphocytic leukemia. 

Example B-4 

is Ointment 

"HI-BIS-WAKO 104", a carboxyvinyl polymer commercialized by Wako Pure Chemicals, Tokyo, Japan, and a pu- 
rified trehalose were dissolved in sterilized distilled water to give concentrations of 1 .4 w/w % and 2.0 w/w %, respec- 
tively, and the purified polypeptides obtained in Examples A-1 to A-4 were separately mixed to homogeneity in the 
20 solutions followed by adjusting the pH of the resulting solutions to pH 7.2 to obtain pastes containing about one mg/g 
of the polypeptides. 

All of the products have satisfactory spreadabilities and stabilities, and can be used as ointments for treating and/ 
or preventing susceptive diseases including a malignant tumor, acute leukemia, malignant lymphoma, an acute trans- 
formation of chronic leukemia and T-lymphocytic leukemia. 

25 

Example B-5 
Tablet 

30 Any one of the purified polypeptides obtained in Examples A-1 to A-4 and LUMIN, i.e. [bis-4-(1-ethylquinoline)][y- 

4'-(1-ethylquinoline] pentamethionine cyanine, as a cell activator were mixed to homogeneity with "FINETOSE®" an 
hydrous crystalline a-maltose commercialized by Hayashibara Co.,Ltd., Okayama, Japan, and the mixtures were tab- 
letted by tabletting machine to obtain tablets, about 200 mg weight each, containing the polypeptide and the LUMIN, 
about 5 mg each. 

35 All of the products have satisfactory swallowing abilities, stabilities and cell activating activities, and can be used 

for treating and/or preventing susceptive diseases including a malignant tumor, acute leukemia, malignant lymphoma, 
an acute transformation of chronic leukemia and T-lymphocytic leukemia. 

The present invention is based on the findings of polypeptides which originate from mammal, having L-asparagi- 
nase activity. The polypeptides are substances which have revealed amino acid sequences totally, and stable activities 
40 to hydrolyze L-asparagine. Therefore, the polypeptides exert satisfactory effects in the treatment and/or the prevention 
for diseases caused by tumor ceils dependent on L-asparagine. 

The polypeptides originate from mammal, so that they have low antigenicities to human and don't cause serious 
side effects even when administered in large amounts or continuously. Therefore, the polypeptides have the advantage 
that they can exert desired effects without restricted controls on patients' sensitivities. 
45 The polypeptides thus valuable can be produced in desired amounts using the present DNAs encoding them. 

Thus, the present invention is a significant invention which has a remarkable effect and gives a great contribution 
to this field. 

While there has been described what is at present considered to be the preferred embodiments of the present 
invention, it will be understood the various modifications may be made therein, and it is intended to cover in the ap- 
so pended claims all such modifications as fall within the true spirits and scope of the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

NAME : KABUSHIKI KAISHA HAY AS H I B ARA SEIBUTSU KAGAKU 
10 KENKYUJO 

(ii) TITLE OF INVENTION: POLYPEPTIDES HAVING L- ASPARAGINASE ACTIVITY 

(iii) NUMBER OF SEQUENCES : 18 

15 (iv) ADDRESS: 

(A) ADDRESSEE: KABUSHIKI KAISHA HAY AS H I B ARA SEIBUTSU KAGAKU 

KENKYUJO 

(B) STREET: 2-3, 1-CHOME, SHIMOISHII 

(C) CITY : OKAYAMA 
20 ( E ) COUNTRY : JAPAN 

(F) POSTAL CODE (ZIP): 700 

<v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM : PC -DOS /MS -DOS 

(D) SOFTWARE: Word Perect Version 5.1 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 168,172/96 
<B) FILING DATE: June 7, 1996 

30 



35 (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 (xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 

Thr Gly Gly Thr 
1 



(3) INFORMATION FOR SEQ ID NO : 2 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE : amino acid 
so (D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 
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His Gly Thr Asp Thr 
1 5 



(4) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

Gin Cys Leu Xaa Gly 
1 5 



(5) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 3 63 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 



Met 


Ala 


Arg 


Ala 


Ser 


Gly 


Ser 


Glu 


Arg 


His 


Leu 


Leu 


Leu 


He 


Tyr 


Thr 


1 






5 










10 










15 




Gly 


Gly 


Thr 


Leu Gly 


Met 


Gin 


Ser 


Lys 


Gly 


Gly 


Val 


Leu 


Val 


Pro 


Gly 








20 










25 










30 






Pro 


Gly 


Leu 


Val 


Thr 


Leu 


Leu 


Arg 


Thr 


Leu 


Pro 


Met 


Phe 


His 


Asp 


Lys 
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40 










45 








Glu 


Phe 


Ala 


Gin 


Ala 


Gin 


Gly 


Leu 


Pro 


Asp 


His 


Ala 


Leu 


Ala 


Leu 


Pro 




50 










55 










60 










Pro 


Ala 


Ser 


His 


Gly 


Pro 


Arg 


Val 


Leu 


Tyr 


Thr 


Val 


Leu 


Glu 


Cys 


Gin 


65 










70 










75 










80 


Pro 


Leu 


Leu 


Asp 


Ser 


Ser 


Asp 


Met 


Thr 


He 


Asp 


Asp 


Trp 


He 


Arg 


He 










85 










90 










95 




Ala 


Lys 


He 


He 


Glu 


Arg 


His 


Tyr 


Glu 


Gin 


Tyr 


Gin 


Gly 


Phe 


Val 


Val 








100 










105 










110 






lie 


His 


Gly 


Thr 


Asp 


Thr 


Met 


Ala 


Phe 


Gly 


Ala 


Ser 


Met 


Leu 


Ser 


Phe 






115 








120 










125 








Met 


Leu 


Glu 


Asn 


Leu 


His 


Lys 


Pro 


Val 


He 


Leu 


Thr 


Gly 


Ala 


Gin 


Val 




130 










135 










140 










Pro 


He 


Arg 


Val 


Leu 


Trp 


Asn 


Asp 


Ala 


Arg 


Glu 


Asn 


Leu 


Leu 


Gly 


Ala 


145 










150 










155 










160 


Leu 


Leu 


Val 


Ala 


Gly 


Gin 


Tyr 


He 


He 


Pro 


Glu 


Val 


Cys 


Leu 


Phe 


Met 










165 










170 










175 




Asn 


Ser 


Gin 


Leu 


Phe 


Arg 


Gly Asn 


Arg 


Val 


Thr 


Lys 


Val 


Asp 


Ser 


Gin 








180 










185 










190 






Lys 


Phe 


Glu 


Ala 


Phe 


Cys 


Ser 


Pro 


Asn 


Leu 


Ser 


Pro 


Leu 


Ala 


Thr 


val 




195 










200 










205 








Gly Ala 


Asp 


Val 


Thr 


He 


Ala 


Trp 


Asp 


Leu 


Val 


Arg 


Lys 


Val 


Asn 


Trp 




210 










215 










220 










Lys 


Asp 


Pro 


Leu 


Val 


val 


His 


Ser 


Asn 


Met 


GlU 


His 


Asp 


Val 


Ala 


Leu 


225 








230 










235 










240 


Leu 


Arg 


Leu 


Tyr 


Pro 


Gly 


He 


Pro 


Ala 


Ser 


Leu 


Val 


Arg 


Ala 


Phe 


Leu 
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245 










250 










255 






Gin 


Pro 


Pro 


Leu 


Lys 


Gly 


Val 


Val 


Leu 


Glu 


Thr 


Phe 


Gly 


Ser 


Gly 


Asn 


5 








260 










265 










270 








Gly 


Pro 


Ser 


Lys 


Pro 


Asp 


Leu 


Leu 


Gin 


Glu 


Leu 


Arg 


Ala 


Ala 


Ala 


Gin 






275 










280 










28S 










Arg 


Gly 


Leu 


He 


Met 


Val 


Asn 


Cys 


Ser 


Gin 


Cys 


Leu 


Arg 


Gly 


Ser 


Val 




290 










295 










300 










10 


Thr 


Pro 


Gly 


Tyr 


Ala 


Thr 


Ser 


Leu 


Ala 


Gly Ala 


Asn 


He 


Val 


Ser 


Gly 


305 










310 










315 










320 




Leu 


Asp 


Met 


Thr 


Ser 


Glu 


Ala 


Ala 


Leu 


Ala 


Lys 


Leu 


Ser 


Tyr 


Val 


Leu 










325 










330 










335 






Gly 


Leu 


Pro 


Glu 


Leu 


Ser 


Leu 


Glu 


Arg 


Arg 


Gin 


Glu 


Leu 


Leu 


Ala 


Lys 








340 










345 










350 






15 


Asp 


Leu 


Arg 
355 


Gly 


Glu 


Met 


Thr 


Leu 
360 


Pro 


Thr 


Ala 
363 













20 



25 



30 



35 



40 



45 



50 



(6) INFORMATION FOR SEQ ID NO : 5 : 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 56 5 amino acids 

(B ) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE : peptide 





(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 : 
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(7) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 65 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

so Met Ala Arg Ala Val Gly Pro Glu Arg Arg Leu Leu Ala Val Tyr Thr 

15 10 15 

Gly Gly Thr He Gly Met Arg Ser Glu Leu Gly Val Leu Val Pro Gly 

20 25 30 

Thr Gly Leu Ala Ala He Leu Arg Thr Leu Pro Met Phe His Asp Glu 
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10 



20 



25 



35 40 45 

Glu His Ala Arg Ala Arg Gly Leu Ser Glu Asp Thr Leu Val Leu Pro 

50 55 60 

Pro Asp Ser Arg Asn Gin Arg lie Leu Tyr Thr Val Leu Glu Cys Gin 
« 70 75 * 80 

Pro Leu Phe Asp Ser Ser Asp Met Thr lie Ala Glu Trp Val Arg Val 

85 90 95 

Ala Gin Thr lie Lys Arg His Tyr Glu Gin Tyr His Gly Phe Val Val 

100 105 110 

lie His Gly Thr Asp Thr Met Ala Phe Ala Ala Ser Met Leu Ser Phe 

115 120 125 

Met Leu Glu Asn Leu Gin Lys Thr Val lie Leu Thr Gly Ala Gin Val 
is 130 135 140 

Pro lie His Ala Leu Trp Ser Asp Gly Arg Glu Asn Leu Leu Gly Ala 
145 150 155 160 

Leu Leu Met Ala Gly Gin Tyr Val lie Pro Glu Val Cys Leu Phe Phe 

165 170 175 

Gin Asn Gin Leu Phe Arg Gly Asn Arg Ala Thr Lys Val Asp Ala Arg 

180 185 190 

Arg Phe Ala Ala Phe Cys Ser Pro Asn Leu Leu Pro Leu Ala Thr Val 

195 200 205 

Gly Ala Asp lie Thr lie Asn Arg Glu Leu Val Arg Lys Val Asp Gly 

210 215 220 

Lys Ala Gly Leu Val Val His Ser Ser Met Glu Gin Asp Val Gly Leu 
225 230 235 240 

Leu Arg Leu Tyr Pro Gly lie Pro Ala Ala Leu Val Arg Ala Phe Leu 

245 250 255 

Gin Pro Pro Leu Lys Gly Val Val Met Glu Thr Phe Gly Ser Gly Asn 
30 260 265 270 

Gly Pro Thr Lys Pro Asp Leu Leu Gin Glu Leu Arg Val Ala Thr Glu 

275 280 285 

Arg Gly Leu Val lie Val Asn Cys Thr Gin Cys Leu Arg Gly Ala Val 

290 295 300 

Thr Thr Asp Tyr Ala Ala Gly Met Ala Met Ala Gly Ala Asn Val lie 
305 310 315 320 

Ser Gly Phe Asp Met Thr Ser Glu Ala Ala Leu Ala Lys Leu Ser Tyr 

325 330 335 

Val Leu Gly Gin Pro Gly Leu Ser Leu Asp Val Arg Lys Glu Leu Leu 
340 345 350 

40 Thr Lys Asp Leu Arg Gly Glu Met Thr Pro Pro Ser Val 

355 360 365 



4S (8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 365 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE: pep tide 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Ala Arg Ala Val Gly Pro Glu Arg Arg Leu Leu Ala Val Tyr Thr 
1 5 10 15 

Gly Gly Thr He Gly Met Arg Ser Glu Leu Gly Val Leu Val Pro Gly 
55 20 25 30 

Thr Gly Leu Ala Ala He Leu Arg Thr Leu Pro Met Phe His Asp Glu 
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(9) INFORMATION FOR SEQ ID NO : 8 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 65 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 



Met Ala Arg Ala Val Gly Pro Glu Arg Arg Leu Leu Ala Val Tyr Thr 
15 10 15 

Gly Gly Thr He Gly Met Arg Ser Glu Leu Gly Val Leu Val Pro Gly 
20 25 30 

55 Thr Gly Leu Ala Ala He Leu Arg Thr Leu Pro Met Phe His Asp Glu 
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(10) INFORMATION FOR SEQ ID NO : 9 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 65 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 
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215 










220 










Lys 


Ala 


Gly 


Leu 


Val 


Val 


His 


Ser 


Ser 


Met 


Glu 


Gin 


Asp 


Val 


Gly 


Leu 


225 








230 










235 










240 


Leu 


Arg 


Leu 


Tyr 


Pro 


Gly 


He 


Pro 


Ala 


Ala 


Leu 


Val 


Arg 


Ala 


Phe 


Leu 








245 










250 










255 




Gin 


Pro 


Pro 


Leu 


Lys 


Gly 


Val 


Val 


Met 


Glu 


Thr 


Phe 


Gly 


Ser 


Gly 


Asn 








260 






265 










270 






Gly 


Pro 


Thr 


Lys 


Pro 


Asp 


Leu 


Leu 


Gin 


Glu 


Leu 


Arg 


Val 


Ala 


Thr 


Glu 




275 








280 










285 








Arg 


Gly 


Leu 


Val 


He 


Val 


Asn 


Cys 


Thr 


Gin 


Cys 


Leu 


Gin 


Gly 


Ala 


Val 


290 










295 










300 










Thr 


Thr 


Asp 


Tyr 


Ala 


Ala 


Gly 


Met 


Ala 


Met 


Ala 


Gly Ala 


Gly 


Val 


He 


305 






310 










315 










320 


Ser 


Gly 


Phe 


Asp 


Met 


Thr 


Ser 


Glu 


Ala 


Ala 


Leu 


Ala 


Lys 


Leu 


Ser 


Tyr 






325 










330 










335 




Val 


Leu 


Gly 


Gin 


Pro 


Gly 


Leu 


Ser 


Leu 


Asp 


Val 


Arg 


Lys 


Glu 


Leu 


Leu 






340 








345 










350 






Thr 


Lys 


Asp 


Leu 


Arg 


Gly 


Glu 


Met 


Thr 


Pro 


Pro 


Ser 


Val 










355 










360 










365 
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(11) INFORMATION FOR SEQ ID NO: 10: 
( i ) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH; 10 8 9 base pairs 

(B) TYPE : nucleic acid 
4S (D) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGCGCGCG CATCAGGCTC CGAGAGGCAC CTGCTGCTCA TCTACACTGG 
GGCATGCAGA GCAAGGGCGG GGTGCTCGTC CCCGGCCCAG GCCTGGTCAC 
ACCCTGCCCA TGTT CCATG A CAAGGAGTTC GCCCAGGCCC AGGGCCTCCC 
CTGGCGCTGC CCCCTGCCAG CCACGGCCCC AGGGTCCTCT ACACGGTGCT 
CCCCTCTTGG ATTCCAGCGA C ATG AC CATC GATGATTGGA TTCGCATAGC 



CGGCACTTTG 60 

TCTGCTGCGG 120 

TGACCATGCT 180 

GGAGTGCCAG 24 0 

CAAGATCATA 3 00 
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GAGAGGCACT 
TTTGGGGCCT 
GGCGCCCAGG 
TTGCTTGTGG 
TTTCGGGGAA 
AATCTGTCCC 
AAGGTCAACT 
CTGCGCCTCT 
AAGGGCGTGG 
CAGGAGTTGC 
CGGGGGTCTG 
TTAGACATGA 
CTGAGCCTGG 
CCCACGGCA 



ATGAGCAGTA 
CCATGCTGTC 
TGCCAATCCG 
CCGGCCAATA 
ACCGGGTAAC 
CACTAGCCAC 
GGAAGGACCC 
ACCCTGGCAT 
TCCTGGAGAC 
GGGCCGCGGC 
TGACCCCGGG 
CCTCAGAGGC 
AGCGCAGGCA 



CCAAGGCTTT 
CTTCATGCTG 
TGTGCTGTGG 
CATCATCCCT 
CAAGGTGGAC 
TGTGGGCGCG 
GCTGGTGGTG 
CCCGGCCTCC 
CTTCGGCTCT 
CCAGCGCGGC 
CTATGCCACG 
CGCGCTGGCT 
GGAGCTGCTG 



GTGGTTATCC 
GAAAACCTGC 
AATGACGCCC 
GAGGTCTGCC 
TCCCAGAAGT 
GATGTCACAA 
CACAGCAACA 
CTGGTCCGGG 
GGCAACGGGC 
CTCATCATGG 
AGCTTGGCGG 
AAGCTGTCCT 
GCCAAGGATC 



ACGGCACCGA 
ACAAACCAGT 
GGGAAAACCT 
TGTTTATGAA 
TTGAGGCCTT 
TTGCCTGGGA 
TGGAGCACGA 
CATTCCTGCA 
CGAGCAAGCC 
TCAACTGCAG 
GCGCCAACAT 
ACGTGTTGGG 
TT CGCGGGG A 



CACCATGGCC 
CATCCTCACT 
GCTGGGGGCG 
CAGTCAGCTG 
CTGCTCCCCC 
CCTGGTGCGC 
CGTGGCACTG 
GCCCCCGCTC 
CGACCTGCTG 
CCAGTGCCTG 
CGTGTCCGGC 
CCTGCCGGAG 
AATGACACTG 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1089 



(12) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

<B) TYPE: nucleic acid 

(D) TOPOLOGY : linear 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG CTGCTGGCCG TCTACACCGG CGGCACCATT 6 0 

GGCATGCGGA GTGAGCTCGG CGTGCTTGTG CCCGGGACGG GCCTGGCTGC CATCCTGAGG 12 0 

ACACTGCCCA TGTTCCATGA CGAGGAGCAC GCCCGAGCCC GCGGCCTCTC TGAGGACACC 18 0 

CTGGTGCTAC CCCCGGACAG CCGCAACCAG AGGATCCTCT ACACCGTGCT GGAGTGCCAG 24 0 

CCCCTCTTCG ACTCCAGTGA CATGACCATC GCTGAGTGGG TTCGCGTTGC CCAGACCATC 300 

AAGAGGCACT ACGAGCAGTA CCACGGCTTT GTGGTCATCC ACGGCACCGA CACCATGGCC 3 60 

TTTGCTGCCT CGATGCTGTC CTTCATGCTG GAGAACCTGC AGAAGACTGT CATCCTCACT 42 0 

GGGGCCCAGG TGCCCATCCA TGCCCTGTGG AGCGACGGCC GTGAGAACCT GCTGGGGGC A 48 0 

CTGCTCATGG CTGGC CAGT A TGTGATCCCA GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 54 0 

TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 600 

AACCTGCTGC CTCTGGCCAC AGTGGGTGCT GACATCACAA TCAACAGGGA GCTGGTGCGG 66 0 

AAGGTGGACG GGAAGGCTGG GCTGGTGGTG CACAGCAGCA TGGAGCAGGA CGTGGGC CTG 72 0 

CTGCGCCTCT AC C CTGGGAT CCCTGCCGCC CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 78 0 

AAGGGCGTGG TCATGGAGAC CTTCGGTTCA GGGAACGGAC CCACCAAGCC CGACCTGCTG 84 0 

CAGGAGCTGC GGGTGGCCAC CGAGCGCGGC CTGGTCATCG TCAACTGTAC CCAGTGCCTC 900 

CGGGGGGCTG TGACCACAGA CTATGCAGCT GGCATGGCCA TGGCGGGAGC CAACGTCATC 960 

TCAGGCTTCG ACATGACATC GGAGGCCGCC CTGGCCAAGC TATCGTATGT GCTGGGC CAG 102 0 

CCAGGGCTGA GCCTGGATGT CAGGAAGGAG CTGCTGACCA AGGACCTTCG GGGGGAGATG 108 0 

ACGCCACCCT CGGTG 10 95 



(13) INFORMATION FOR SEQ ID NO:12: 
( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1095 base pairs 
45 (B) TYPE : nucleic acid 

( D > TOPOLOGY : 1 inear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG CTGCTGGCCG TCTACACCGG CGGCACCATT 6 0 
50 GGCATGCGGA GTGAGCTCGG CGTGCTTGTG CCCGGGACGG GCCTGGCTGC CATCCTGAGG 12 0 

ACACTGCCCA TGTTCCATGA CGAGGAGCAC GCCCGAGCCC GCGGCCTCTC TGAGGACACC 180 



55 



36 
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CTGGTGCTAC CCCCGGACAG CCGCAACCAG AGGATCCTCT ACACCGTGCT GGAGTGCCAG 24 0 
CCCCTCTTCG ACTCCAGTGA CATGACCATC GCTGAGTGGG TTCGCGTTGC CCAGACCATC 3 00 
AAGAGGCACT ACGAGCAGTA CCACGGCTTT GTGGTCATCC ACGGCACCGA CACCATGGCC 36 0 
TTTGCTGCCT CGATGCTGTC CTTCATGCTG GAGAACCTGC AGAAGACTGT CATCCTCACT 42 0 
GGGGCCCAGG TGCCCATCCA TGC CCTGTGG AGCGACGGCC GTGAGAACCT GCTGGGGGCA 48 0 
CTGCTCATGG CTGGCCAGTA TGTGATCCCA GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 54 0 
TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 6 00 
AACCTGCTGC CTCTGGCCAC AGTGGGTGCT GACATCACAA TCAACAGGGA GCTGGTGCGG 66 0 
AAGGTGGACG GGAAGGCTGG GCTGGTGGTG CACAGCAGCA TGGAGCAGGA CGTGGGCCTG 720 
CTGCGCCTCT ACCCTGGGAT CCCTGCCGCC CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 78 0 
AAGGGCGTGG TCATGGAGAC CTTCGGTTCA GGGAACGGAC CCACCAAGCC CGACCTGCTG 84 0 
CAGGAGCTGC GGGTGG C C AC CGAGCGCGGC CTGGTCATCG TCAACTGTAC CCAGTGCCTC 9 00 
CGGGGGGCTG TGACCACAGA CTATGCAGCT GGCATGGCCA TGGCGGGAGC CGGCGTCATC 960 
TCAGGCTTCG AC ATG AC AT C GGAGGCCGCC CTGGCCAAGC TATCGTATGT GCTGGGCCAG 1020 
CCAGGGCTGA GCCTGGATGT CAGGAAGGAG CTGCTGACCA AGGACCTTCG GGGGGAGATG 10 8 0 
ACGCCACCCT CGGTG 1095 



20 (14) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 10 9 5 base pairs 

(B) TYPE : nucleic acid 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

25 

ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG CTGCTGGCCG TCTACACCGG CGGCACCATT 60 

GGCATGCGGA GTGAGCT CGG CGTGCTTGTG CCCGGGACGG GCCTGGCTGC CATCCTGAGG 12 0 

ACACTGCCCA TGTTCCATGA CGAGGAGCAC GCCCGAGCCC GCGGCCTCTC TGAGGACACC 180 

CTGGTGCTAC CCCCGGACAG CCGCAACCAG AGGATCCTCT ACACCGTGCT GGAGTGCCAG 240 

CCCCTCTTCG ACTCCAGTGA CATGACCATC GCTGAGTGGG TTCGCGTTGC CCAGACCATC 3 00 

30 AAGAGGCACT ACGAGCAGTA CCACGGCTTT GTGGTCATCC ACGGCACCGA CACCATGGCC 3 60 

TTTGCTGCCT CGATGCTGTC CTTCATGCTG GAGAACCTGC AGAAGACTGT CATCCTCACT 420 

GGGGCCCAGG TGCCCATCCA TGCCCTGTGG AGCGACGGCC GTGAGAACCT GCTGGGGGCA 4 80 

CTGCTCATGG CTGGCCAGTA TGTGATCCCA GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 54 0 

TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 6 00 

AACCTGCTGC CTCTGGCCAC AGTGGGTGCT GACATCACAA TCAACAGGGA GCTGGTGCGG 660 

35 AAGGTGGACG GGAAGGCTGG GCTGGTGGTG CACAGCAGCA TGGAGCAGGA CGTGGGCCTG 72 0 

CTGCGCCTCT ACCCTGGGAT CCCTGCCGCC CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 7 80 

AAGGGCGTGG TCATGGAGAC CTTCGGTTCA GGGAACGGAC CCACCAAGCC CGACCTGCTG 840 

CAGGAGCTGC GGGTGG CCAC CGAGCGCGGC CTGGTCATCG TCAACTGTAC CCAGTGCCTC 900 

CAGGGGGCTG TGACCACAGA CTATGCAGCT GGCATGGCCA TGGCGGGAGC CAACGTCATC 96 0 

TCAGGCTTCG ACATGACATC GGAGGCCGCC CTGGCCAAGC TATCGTATGT GCTGGGCCAG 102 0 

40 CCAGGGCTGA GCCTGGATGT CAGGAAGGAG CTGCTGACCA AGGACCTTCG GGGGGAGATG 10 8 0 

ACGCCACCCT CGGTG 10 95 



(15) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 95 base pairs 

(B) TYPE : nucleic acid 
(D> TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG CTGCTGGCCG TCTACACCGG CGGCACCATT 6 0 
GGCATGCGGA GTGAGCTCGG CGTGCTTGTG CCCGGGACGG GCCTGGCTGC CATCCTGAGG 12 0 
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ACACTGCCCA TGTTCCATGA CGAGGAGCAC GCCCGAGCCC GCGGCCTCTC TGAGGACACC 180 
CTGGTGCTAC CCCCGGACAG CCGCAACCAG AGGATCCTCT ACACCGTGCT GGAGTGCCAG 24 0 
CCCCTCTTCG ACTCCAGTGA CATGACCATC GCTGAGTGGG TTCGCGTTGC CCAGACCATC 300 
AAGAGGCACT ACGAGCAGTA CCACGGCTTT GTGGTCATCC ACGGCACCGA CACCATGGCC 360 
TTTGCTGCCT CGATGCTGTC CTTCATGCTG GAGAACCTGC AGAAGACTGT CATCCTCACT 420 
GGGGCCCAGG TGCCCATCCA TGCCCTGTGG AGCGACGGCC GTGAGAACCT GCTGGGGGCA 480 
CTGCTCATGG CTGGCCAGTA TGTGATCCCA GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 540 
TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 600 
AACCTGCTGC CTCTGGCCAC AGTGGGTGCT G AC AT CAC AA TCAACAGGGA GCTGGTGCGG 6 60 
AAGGTGGACG GGAAGG CTGG GCTGGTGGTG CACAGCAGCA TGGAGCAGGA CGTGGGCCTG 72 0 
CTGCGCCTCT ACCCTGGGAT CCCTGCCGCC CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 7 80 
AAGGGCGTGG T CATGG AG AC CTTCGGTTCA GGGAACGGAC CCACCAAGCC CGACCTGCTG 84 0 
CAGGAGCTGC GGGTGG CC AC CGAGCGCGGC CTGGTCATCG TCAACTGTAC CCAGTGCCTC 90 0 
CAGGGGGCTG TGACCACAGA CTATGCAGCT GGCATGGCCA TGGCGGGAGC CGGCGTCATC 96 0 
TCAGGCTTCG ACATG AC AT C GGAGGCCGCC CTGGCCAAGC TATCGTATGT GCTGGGCCAG 102 0 
CCAGGGCTGA GCCTGGATGT CAGGAAGGAG CTGCTGACCA AGGACCTTCG GGGGGAGATG 1080 
ACGCCACCCT CGGTG 10 95 



(16) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1928 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA to mRNA 

(iii) HYPOTHETICAL: No 

( iv) ANT I -SENSE : No 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: guinea pig 
{ F) TISSUE TYPE: liver 
(iX) FEATURE: 

(A) NAME/KEY : 5 ' UTR 

(B) LOCATION: 1. .19 

(C) IDENTIFICATION METHODS : S 

(A) NAME/KEY :mat peptide 

(B) LOCATION: 20 . . 1714 

(C) IDENTIFICATION METHODS : S 

(A) NAME/KEY : 3 ' UTR 

(B) LOCATION : 1715 . .1928 

(C) IDENTIFICATION METHODS : S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAGTGGCTTA GCCGCAGGC ATG GCG CGC GCA TCA GGC TCC GAG AGG CAC 4 9 

Met Ala Arg Ala Ser Gly Ser Glu Arg His 
15 10 
CTG CTG CTC ATC TAC ACT GGC GGC ACT TTG GGC ATG CAG AGC AAG GGC 97 
Leu Leu Leu lie Tyr Thr Gly Gly Thr Leu Gly Met Gin Ser Lys Gly 

15 20 25 

GGG GTG CTC GTC CCC GGC CCA GGC CTG GTC ACT CTG CTG CGG ACC CTG 145 
Gly Val Leu Val Pro Gly Pro Gly Leu Val Thr Leu Leu Arg Thr Leu 

30 35 40 

CCC ATG TTC CAT GAC AAG GAG TTC GCC CAG GCC CAG GGC CTC CCT GAC 193 
Pro Met Phe His Asp Lys Glu Phe Ala Gin Ala Gin Gly Leu Pro Asp 

45 50 55 

CAT GCT CTG GCG CTG CCC CCT GCC AGC CAC GGC CCC AGG GTC CTC TAC 241 
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His Ala Leu Ala Leu Pro Pro Ala Ser His Gly Pro Arg Val Leu Tyr 

60 65 70 

ACG GTG CTG GAG TGC CAG CCC CTC TTG GAT TCC AGC GAC ATG ACC ATC 289 
Thr Val Leu Glu Cys Gin Pro Leu Leu Asp Ser Ser Asp Met Thr lie 
75 80 85 90 

GAT GAT TGG ATT CGC ATA GCC AAG ATC ATA GAG AGG CAC TAT GAG CAG 337 
Asp Asp Trp He Arg He Ala Lys He He Glu Arg His Tyr Glu Gin 

95 100 105 

TAC CAA GGC TTT GTG GTT ATC CAC GGC ACC GAC ACC ATG GCC TTT GGG 3 85 

Tvr Gin Gly Phe Val Val He His Gly Thr Asp Thr Met Ala Phe Gly 

* 110 115 120 

GCC TCC ATG CTG TCC TTC ATG CTG GAA AAC CTG CAC AAA CCA GTC ATC 43 3 

Ala Ser Met Leu Ser Phe Met Leu Glu Asn Leu His Lys Pro Val He 

125 130 135 

CTC ACT GGC GCC CAG GTG CCA ATC CGT GTG CTG TGG AAT GAC GCC CGG 4 81 

Leu Thr Gly Ala Gin Val Pro He Arg Val Leu Trp Asn Asp Ala Arg 

140 145 150 

GAA AAC CTG CTG GGG GCG TTG CTT GTG GCC GGC CAA TAC ATC ATC CCT 529 
Glu Asn Leu Leu Gly Ala Leu Leu Val Ala Gly Gin Tyr He He Pro 
on 155 160 165 170 

GAG GTC TGC CTG TTT ATG AAC AGT CAG CTG TTT CGG GGA AAC CGG GTA 577 
Glu Val Cys Leu Phe Met Asn Ser Gin Leu Phe Arg Gly Asn Arg Val 

175 180 185 

ACC AAG GTG GAC TCC CAG AAG TTT GAG GCC TTC TGC TCC CCC AAT CTG 6 25 

Thr Lys Val Asp Ser Gin Lys Phe Glu Ala Phe Cys Ser Pro Asn Leu 

190 195 200 

TCC CCA CTA GCC ACT GTG GGC GCG GAT GTC ACA ATT GCC TGG GAC CTG 67 3 

Ser Pro Leu Ala Thr Val Gly Ala Asp Val Thr He Ala Trp Asp Leu 

205 210 215 

GTG CGC AAG GTC AAC TGG AAG GAC CCG CTG GTG GTG CAC AGC AAC ATG 721 
Val Arg Lys Val Asn Trp Lys Asp Pro Leu Val Val His Ser Asn Met 
30 220 225 230 

GAG CAC GAC GTG GCA CTG CTG CGC CTC TAC CCT GGC ATC CCG GCC TCC 76 9 

Glu His Asp Val Ala Leu Leu Arg Leu Tyr Pro Gly He Pro Ala Ser 
235 240 245 250 

CTG GTC CGG GCA TTC CTG CAG CCC CCG CTC AAG GGC GTG GTC CTG GAG 817 
Leu Val Ara Ala Phe Leu Gin Pro Pro Leu Lys Gly Val Val Leu Glu 
35 255 260 265 

ACC TTC GGC TCT GGC AAC GGG CCG AGC AAG CCC GAC CTG CTG CAG GAG 865 
Thr Phe Gly Ser Gly Asn Gly Pro Ser Lys Pro Asp Leu Leu Gin Glu 

270 275 280 

TTG CGG GCC GCG GCC CAG CGC GGC CTC ATC ATG GTC AAC TGC AGC CAG 913 
40 Leu Arg Ala Ala Ala Gin Arg Gly Leu He Met Val Asn Cys Ser Gin 

285 290 295 

TGC CTG CGG GGG TCT GTG ACC CCG GGC TAT GCC ACG AGC TTG GCG GGC 961 
Cys Leu Arg Gly Ser Val Thr Pro Gly Tyr Ala Thr Ser Leu Ala Gly 

300 305 310 

GCC AAC ATC GTG TCC GGC TTA GAC ATG ACC TCA GAG GCC GCG CTG GCT 10 09 
45 Ala Asn He Val Ser Gly Leu Asp Met Thr Ser Glu Ala Ala Leu Ala 

315 320 325 330 

AAG CTG TCC TAC GTG TTG GGC CTG CCG GAG CTG AGC CTG GAG CGC AGG 1057 
Lvs Leu Ser Tyr Val Leu Gly Leu Pro Glu Leu Ser Leu Glu Arg Arg 

335 340 345 

CAG GAG CTG CTG GCC AAG GAT CTT CGC GGG GAA ATG ACA CTG CCC ACG 1105 
Gin Glu Leu Leu Ala Lys Asp Leu Arg Gly Glu Met Thr Leu Pro Thr 

350 355 360 

GCA GAC CTG CAC CAG TCC TCT CCG CCG GGC AGC ACA CTG GGG CAA GGT 1153 
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15 



20 



25 



30 



Ala 


Asp 


Leu 


His 


Gin 


Ser 


Ser 


Pro 




365 










370 


GTC 


GCC 


CGG 


CTC 


TTT 


AGT 


CTG 


TTC 


val 


Ala 


Arg 


Leu 


Phe 


Ser 


Leu 


Phe 




380 








385 




CAG 


GAC 


GCC 


GTG 


ATG 


CCC 


AGC 


CTG 


Gin 


Asp 


Ala 


Val 


Met 


Pro 


Ser 


Leu 


395 








400 






GAA 


CTC 


GAG 


GCT 


CTG 


CAG 


GCA 


CTT 


Glu 


Leu 


Glu 


Ala 


Leu 


Gin 


Ala 


Leu 










415 








CTA 


AAG 


GAC 


TCT 


AAT 


GGC 


CAA 


ACC 


Leu 


Lys 


Asp 


Ser 


Asn 


Gly 


Gin 


Thr 






430 










GGG 


CGT 


GAT 


GGC 


GTG 


GTC 


ACC 


ATG 


Gly 


Arg 


Asp 


Gly 


Val 


Val 


Thr 


Met 






445 










450 


AAT 


GCC 


CGA 


GAC 


CGA 


GAC 


GGC 


CTC 


Asn 


Ala 


Arg 


Asp 


Arg 


Asp 


Gly 


Leu 




460 










465 




GGC 


AGG 


CAT 


CGG 


GAA 


TGC 


ATC 


AGG 


Gly 


Arg 


His 


Arg 


Glu 


Cys 


He 


Arg 


475 










480 






CTG 


TCC 


CCC 


CAG 


GAC 


CTG 


AAG 


GAT 


Leu 


Ser 


Pro 


Gin 


Asp 


Leu 


Lys 


Asp 










495 








GCA 


TCC 


AGG 


GCT 


GAC 


ATG 


GAA 


GGC 


Ala 


Ser 


Arg 


Ala 


Asp 


Met 


Glu 


Gly 








510 










GCC 


GAC 


CTG 


CAG 


CAG 


CCG 


GGC 


TAT 


Ala 


Asp 


Leu 


Gin 


Gin 


Pro 


Gly 


Tyr 




525 










530 


GCA 


GAA 


GCA 


GCC 


GGG 


AAC 


CAG 


GAG 


Ala 


Glu 


Ala 


Ala 


Gly 


Asn 


Gin 


Glu 




540 










545 




GCA 


CTT 


GTA 


GGC 


CCG 


GAA 


GTG 


CCG 


Ala 


Leu 


Val 


Gly 


Pro 


Glu 


Val 


Pro 


555 








560 







35 GTGGTG TG AG CCACTCCGCC ATCTGCTGCT 

CCCCCCAGCA GGATGCACCC CACTACTTAG 
TGCAAAGGTC TTTGTTGGCA GAACAGCAAT 
AAAAAAAAAA AAAA 



Pro 


Gly 


Ser 


Thr 


Leu 


Gly 


Gin 


Gly 












375 










GGT 


TGC 


CAG 


GAG 


GAA 


GAT 


TCG 


GTG 


1201 


Gly 


Cys 


Gin 


Glu 


Glu 


As P 


Ser 


Val 










390 












GCC 


CTG 


GCC 


TTG 


GCC 


CAT 


GCT 


GGT 


1249 


Ala 


Leu 


Ala 


Leu 


Ala 


His 


Ala 


Gly 








405 










410 




ATG 


GAG 


CTG 


GGC 


AGT 


GAC 


CTG 


CGC 


1297 


Met 


Glu 


Leu 


Gly 


Ser 


Asp 


Leu 


Arg 






420 










425 






CTG 


TTG 


CAT 


GTG 


GCT 


GCT 


CGG 


AAT 


1345 


Leu 


Leu 


His 


Val 


Ala 


Ala 


Arg 


Asn 




435 










440 








CTG 


CTG 


CAC 


AGA 


GGC 


ATG 


GAT 


GTC 


1393 


Leu 


Leu 


His 


Arg 


Gly 


Met 


Asp 


Val 












455 










AGC 


CCA 


CTG 


CTG 


TTG 


GCT 


GTA 


CAG 


1441 


Ser 


Pro 


Leu 


Leu 


Leu 


Ala 


Val 


Gin 










470 












CTG 


CTG 


CGG 


AAG 


GCT 


GGG 


GCC 


TGC 


1489 


Leu 


Leu 


Arg 


Lys 


Ala 


Gly 


Ala 


Cys 








485 










490 




GCA 


GGG 


ACC 


GAG 


CTG 


TGC 


AGG 


CTG 


1537 


Ala 


Gly 


Thr 


Glu 


Leu 


Cys 


Arg 


Leu 






500 










505 






CTG 


CAG 


GCA 


TGG 


GGG 


CAG 


GCT 


GGG 


1585 


Leu 


Gin 


Ala 


Trp 


Gly 


Gin 


Ala 


Gly 




515 










520 








GAT 


GGG 


CGC 


AGC 


GCT 


CTG 


TGT 


GTC 


1633 


Asp 


Gly 


Arg 


Ser 


Ala 


Leu 


Cys 


Val 












535 










GTG 


CTG 


GCC 


CTT 


CTG 


CGG 


AAC 


CTG 


1681 


Val 


Leu 


Ala 


Leu 


Leu 


Arg 


Asn 


Leu 










550 












CCT 


GCC 


ATC 


TGATCGCCAG CAATCCCGCT 


1734 


Pro 


Ala 


He 















565 

TTGACCCACT CGAGGGACCC TAGCACACGA 1794 
AGT AT AC CC C AGGCTGGCTC AGTGACAAGC 1854 
AAAGTAACTA CAGAGTGGCC AAAAAAAAAA 1914 

1928 



40 

(17) INFORMATION FOR SEQ ID NO: 16: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 96 base pairs 

(B) TYPE : nucleic acid 

( C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA to mRNA 
( i i i ) HYPOTHET I CAL : No 

( iv) ANTI - SENSE : No 
{ vi ) ORIGINAL SOURCE: 
(A) ORGAN I SM : human 
50 (F) TISSUE TYPE: liver 



55 
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<iX) FEATURE: 

(A) NAME / KEY : 5 ' UTR 

(B) LOCATION: 1 . .92 

(C) IDENTIFICATION METHODS : S 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 93 . . 1811 

(C) IDENTIFICATION METHODS : S 

(A) NAME /KEY : 3 ' UTR 

(B) LOCATION: 1812. .2096 

(C) IDENTIFICATION METHODS : S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



20 



CGCCCCGGGC CTCCTCCGCG CAGTCCCTGA GTCCCGCAGG CCCTGCGTCC CCGCTGCACA 6 0 
75 CCCCCGTCCA CTCCCGTGGT CCCCGGTCCG GC ATG GCG CGC GCG GTG GGG CCC 113 

Met Ala Arg Ala Val Gly Pro 
1 5 

GAG CGG AGG CTG CTG GCC GTC TAC ACC GGC GGC ACC ATT GGC ATG CGG 161 

Glu Arg Arg Leu Leu Ala Val Tyr Thr Gly Gly Thr lie Gly Met Arg 

10 15 20 

AGT GAG CTC GGC GTG CTT GTG CCC GGG ACG GGC CTG GCT GCC ATC CTG 20 9 

Ser Glu Leu Gly Val Leu Val Pro Gly Thr Gly Leu Ala Ala lie Leu 

25 30 35 

AGG ACA CTG CCC ATG TTC CAT GAC GAG GAG CAC GCC CGA GCC CGC GGC 25 7 

Arg Thr Leu Pro Met Phe His Asp Glu Glu His Ala Arg Ala Arg Gly 

40 45 50 55 

25 CTC TCT GAG GAC ACC CTG GTG CTA CCC CCG GAC AGC CGC AAC CAG AGG 3 05 

Leu Ser Glu Asp Thr Leu Val Leu Pro Pro Asp Ser Arg Asn Gin Arg 

60 65 70 

ATC CTC TAC ACC GTG CTG GAG TGC CAG CCC CTC TTC GAC TCC AGT GAC 3 53 

lie Leu Tyr Thr Val Leu Glu Cys Gin Pro Leu Phe Asp Ser Ser Asp 
75 80 85 

30 ATG ACC ATC GCT GAG TGG GTT CGC GTT GCC CAG ACC ATC AAG AGG CAC 4 01 

Met Thr lie Ala Glu Trp Val Arg Val Ala Gin Thr lie Lys Arg His 

90 95 100 

TAC GAG CAG TAC CAC GGC TTT GTG GTC ATC CAC GGC ACC GAC ACC ATG 44 9 

Tyr Glu Gin Tyr His Gly Phe Val Val lie His Gly Thr Asp Thr Met 
105 110 115 

35 GCC TTT GCT GCC TCG ATG CTG TCC TTC ATG CTG GAG AAC CTG CAG AAG 4 97 

Ala Phe Ala Ala Ser Met Leu Ser Phe Met Leu Glu Asn Leu Gin Lys 

120 125 130 135 

ACT GTC ATC CTC ACT GGG GCC CAG GTG CCC ATC CAT GCC CTG TGG AGC 54 5 

Thr Val lie Leu Thr Gly Ala Gin Val Pro lie His Ala Leu Trp Ser 
140 145 150 

40 GAC GGC CGT GAG AAC CTG CTG GGG GCA CTG CTC ATG GCT GGC CAG TAT 5 93 

Asp Gly Arg Glu Asn Leu Leu Gly Ala Leu Leu Met Ala Gly Gin Tyr 

155 160 165 

GTG ATC CCA GAG GTC TGC CTT TTC TTC CAG AAT CAG CTG TTT CGG GGC 641 

Val lie Pro Glu Val Cys Leu Phe Phe Gin Asn Gin Leu Phe Arg Gly 
170 175 180 

45 AAC CGG GCA ACC AAG GTA GAC GCT CGG AGG TTC GCA GCT TTC TGC TCC 68 9 

Asn Arg Ala Thr Lys Val Asp Ala Arg Arg Phe Ala Ala Phe Cys Ser 

185 190 195 

CCG AAC CTG CTG CCT CTG GCC ACA GTG GGT GCT GAC ATC ACA ATC AAC 737 

Pro Asn Leu Leu Pro Leu Ala Thr Val Gly Ala Asp lie Thr lie Asn 

200 205 210 215 

50 AGG GAG CTG GTG CGG AAG GTG GAC GGG AAG GCT GGG CTG GTG GTG CAC 78 5 
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20 



45 



SO 



Arg Glu Leu Val Arg Lys Val Asp Gly Lys Ala Gly Leu Val Val His 
220 225 230 

5 AGC AGC ATG GAG CAG GAC GTG GGC CTG CTG CGC CTC TAC CCT GGG ATC 833 

Ser Ser Met Glu Gin Asp Val Gly Leu Leu Arg Leu Tyr Pro Gly He 

235 240 245 

CCT GCC GCC CTG GTT CGG GCC TTC TTG CAG CCT CCC CTG AAG GGC GTG 881 
Pro Ala Ala Leu Val Arg Ala Phe Leu Gin Pro Pro Leu Lys Gly Val 
250 255 260 

10 GTC ATG GAG ACC TTC GGT TCA GGG AAC GGA CCC ACC AAG CCC GAC CTG 92 9 

Val Met Glu Thr Phe Gly Ser Gly Asn Gly Pro Thr Lys Pro Asp Leu 

265 270 275 

CTG CAG GAG CTG CGG GTG GCC ACC GAG CGC GGC CTG GTC ATC GTC AAC 977 
Leu Gin Glu Leu Arg Val Ala Thr Glu Arg Gly Leu Val He Val Asn 
280 285 290 295 

TGT ACC CAC TGC CTC CAG GGG GCT GTG ACC ACA GAC TAT GCA GCT GGC 102 5 
Cys Thr His Cys Leu Gin Gly Ala Val Thr Thr Asp Tyr Ala Ala Gly 

300 305 310 

ATG GCC ATG GCG GGA GCC GGC GTC ATC TCA GGC TTC GAC ATG ACA TCG 107 3 
Met Ala Met Ala Gly Ala Gly Val lie Ser Gly Phe Asp Met Thr Ser 

315 320 325 

GAG GCC GCC CTG GCC AAG CTA TCG TAT GTG CTG GGC CAG CCA GGG CTG 1121 
Glu Ala Ala Leu Ala Lys Leu Ser Tyr Val Leu Gly Gin Pro Gly Leu 

330 335 340 

AGC CTG GAT GTC AGG AAG GAG CTG CTG ACC AAG GAC CTT CGG GGG GAG 116 9 
Ser Leu Asp Val Arg Lys Glu Leu Leu Thr Lys Asp Leu Arg Gly Glu 
2S 345 350 355 

ATG ACG CCA CCC TCG GTG GAA GAG CGC CGG CCC TCA CTG CAG GGC AAC 1217 
Met Thr Pro Pro Ser Val Glu Glu Arg Arg Pro Ser Leu Gin Gly Asn 
360 365 370 375 

ACG CTG GGC GGT GGG GTC TCC TGG CTC CTC AGT CTG AGC GGC AGC CAG 12 65 
Thr Leu Gly Gly Gly Val Ser Trp Leu Leu Ser Leu Ser Gly Ser Gin 
30 380 385 390 

GAG GCA GAT GCC CTG CGG AAT GCC CTG GTG CCC AGC CTG GCC TGT GCT 1313 
Glu Ala Asp Ala Leu Arg Asn Ala Leu Val Pro Ser Leu Ala Cys Ala 

395 400 405 

GCT GCC CAC GCC GGT GAC GTG GAG GCG CTG CAG GCG CTT GTG GAG CTG 13 61 
Ala Ala His Ala Gly Asp Val Glu Ala Leu Gin Ala Leu Val Glu Leu 
3S 410 415 420 

GGC AGT GAC CTG GGC CTG GTG GAC TTT AAC GGC CAA ACC CCA CTG CAC 14 0 9 
Gly Ser Asp Leu Gly Leu Val Asp Phe Asn Gly Gin Thr Pro Leu His 

425 430 435 

GCG GCC GCC CGG GGA GGC CAC ACA GAG GCA GTC ACC ATG CTG CTG CAG 145 7 
Ala Ala Ala Arg Gly Gly His Thr Glu Ala Val Thr Met Leu Leu Gin 
40 4 4 0 4 4 5 4 5 0 4 5 5 

AGA GGT GTG GAC GTG AAC ACC CGG GAC ACG GAT GGC TTC AGC CCG CTG 1505 
Arci Glv Val Asp Val Asn Thr Arg Asp Thr Asp Gly Phe Ser Pro Leu 

* 7 460 465 470 

CTG CTG GCC GTG CGG GGC AGG CAT CCG GGT GTC ATT GGG TTG CTG CGG 155 3 
Leu Leu Ala Val Arg Gly Arg His Pro Gly Val He Gly Leu Leu Arg 

475 480 485 

GAA GCC GGG GCC TCC CTG TCC ACC CAG GAG CTG GAG GAA GCA GGG ACG 16 01 
Glu Ala Gly Ala Ser Leu Ser Thr Gin Glu Leu Glu Glu Ala Gly Thr 

490 495 500 

GAG CTG TGC AGG CTG GCA TAC AGG GCC GAC CTC GAA GGC CTG CAG GTG 164 9 
Glu Leu Cys Arg Leu Ala Tyr Arg Ala Asp Leu Glu Gly Leu Gin Val 

505 510 515 

TGG TGG CAG GCA GGG GCT GAC CTG GGG CAG CCG GGC TAT GAC GGG CAC 16 97 
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Trp 


Trp 


Gin 


Ala 


Gly 


Ala 


Asp Leu 


Gly 


Gin 


Pro 


Gly 


Tyr 


Asp 


Gly 


His 




5 


520 










525 








530 










535 






AGC 


GCC 


CTG 


CAC 


GTC 


GCA 


GAG GCA 


GCC 


GGG 


AAC 


CTG 


GCA 


GTG 


GTG 


GCC 


1745 




Ser 


Ala 


Leu 


His 


Val 
540 


Ala 


Glu Ala 


Ala 


Gly 
545 


Asn 


Leu 


Ala 


Val 


Val 
550 


Ala 






TTT 


CTA 


CAG 


AGC 


CTG 


GAG 


GGT GCG 


GTT 


GGT 


GCC 


CAG 


GCC 


CCA 


TGC 


CCA 


1793 




Phe 


Leu 


Gin 


Ser 


Leu 


Glu 


Gly Ala 


Val 


Gly 


Ala 


Gin 


Ala 


Pro 


Cys 


Pro 




10 








555 








560 










565 










GAA 


GTG 


CTG 


CCT 


GGT 


GTC 


TAACCTGAAG GCGTCCTGCT GCAGTATAAG 




1841 




Glu 


Val 


Leu 
570 


Pro 


Gly 


Val 























CCATTCCTTC CTCCCATGAC CTGCTGGAGG GGTCTCAGGC ATGACCCCAC TGCTGGGGCT 1901 
GCTTCCCAGC CTGCTCTCAT GTAAAGCCTG AAGGC CTTTG TTGGGCAGGA CGGCAATAAA 19 61 
75 GTCTCTGACA TCCCCTCACC AGGTCTGTAC AGCCTGGCTC TGAGAGGCTC TGTCTGGGTC 2 021 

CGGGACTGTG AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2081 
AAAAAAAAAA AAAAA 2096 



20 
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(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : cDNA to mRNA 

(iii) HYPOTHETICAL: No 

( iv) ANTI -SENSE : No 
(vi) ORIGINAL SOURCE : 

30 (A) ORGANISM: guinea pig 

(F) TISSUE TYPE: liver 
(iX) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATIOM:l. .1695 

(xi)i 

35 



40 



45 



SO 





(Xi) SEQUENCE DESCRIPTION: 


SEQ 


ID NO: 17 : 












ATG 


GCG 


CGC 


GCA 


TCA 


GGC 


TCC 


GAG 


AGG 


CAC 


CTG 


CTG 


CTC 


ATC 


TAC 


ACT 


48 


Met 


Ala 


Arg 


Ala 


Ser 


Gly 


Ser 


Glu 


Arg 


His 


Leu 


Leu 


Leu 


He 


Tyr 


Thr 




1 






5 










10 










15 






GGC 


GGC 


ACT 


TTG 


GGC 


ATG 


CAG 


AGC 


AAG 


GGC 


GGG 


GTG 


CTC 


GTC 


CCC 


GGC 


96 


Gly Gly Thr 


Leu 


Gly Met 


Gin 


Ser 


Lys 


Gly 


Gly 


Val 


Leu 


Val 


Pro 


Gly 










20 










25 










30 








CCA 


GGC 


CTG 


GTC 


ACT 


CTG 


CTG 


CGG 


ACC 


CTG 


CCC 


ATG 


TTC 


CAT 


GAC 


AAG 


144 


Pro 


Gly 


Leu 


Val 


Thr 


Leu 


Leu 


Arg 


Thr 


Leu 


Pro 


Met 


Phe 


His 


Asp 


Lys 






35 










40 










45 










GAG 


TTC 


GCC 


CAG 


GCC 


CAG 


GGC 


CTC 


CCT 


GAC 


CAT 


GCT 


CTG 


GCG 


CTG 


CCC 


192 


Glu 


Phe 


Ala 


Gin 


Ala 


Gin Gly Leu 


Pro 


Asp 


His 


Ala 


Leu 


Ala 


Leu 


Pro 






50 










55 










60 












CCT 


GCC 


AGC 


CAC 


GGC 


CCC 


AGG 


GTC 


CTC 


TAC 


ACG 


GTG 


CTG 


GAG 


TGC 


CAG 


240 


Pro 


Ala 


Ser 


His 


Gly 


Pro 


Arg 


Val 


Leu 


Tyr 


Thr 


Val 


Leu 


Glu 


Cys 


Gin 




65 








70 








75 










80 




CCC 


CTC 


TTG 


GAT 


TCC 


AGC 


GAC 


ATG 


ACC 


ATC 


GAT 


GAT 


TGG 


ATT 


CGC 


ATA 


288 


Pro 


Leu 


Leu 


Asp 


Ser 


Ser 


Asp 


Met 


Thr 


He 


Asp 


Asp 


Trp 


He 


Arg 


He 










85 










90 










95 






GCC 


AAG 


ATC 


ATA 


GAG 


AGG 


CAC 


TAT 


GAG 


CAG 


TAC 


CAA 


GGC 


TTT 


GTG 


GTT 


336 


Ala 


Lys 


He 


He 


Glu 


Arg 


His 


Tyr 


Glu 


Gin 


Tyr 


Gin 


Gly 


Phe 


Val 


Val 
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100 105 110 

ATC CAC GGC ACC GAC ACC ATG GCC TTT GGG GCC TCC ATG CTG TCC TTC 3 84 

He His Gly Thr Asp Thr Met Ala Phe Gly Ala Ser Met Leu Ser Phe 

115 120 125 

ATG CTG GAA AAC CTG CAC AAA CCA GTC ATC CTC ACT GGC GCC CAG GTG 43 2 

Met Leu Glu Asn Leu His Lys Pro Val He Leu Thr Gly Ala Gin Val 

130 135 140 

CCA ATC CGT GTG CTG TGG AAT GAC GCC CGG GAA AAC CTG CTG GGG GCG 48 0 

Pro He Arg Val Leu Trp Asn Asp Ala Arg Glu Asn Leu Leu Gly Ala 
145 150 155 160 

TTG CTT GTG GCC GGC CAA TAC ATC ATC CCT GAG GTC TGC CTG TTT ATG 52 8 

Leu Leu Val Ala Gly Gin Tyr He He Pro Glu Val Cys Leu Phe Met 

165 170 175 

AAC AGT CAG CTG TTT CGG GGA AAC CGG GTA ACC AAG GTG GAC TCC CAG 576 
Asn Ser Gin Leu Phe Arg Gly Asn Arg Val Thr Lys Val Asp Ser Gin 

180 185 190 

AAG TTT GAG GCC TTC TGC TCC CCC AAT CTG TCC CCA CTA GCC ACT GTG 624 
Lys Phe Glu Ala Phe Cys Ser Pro Asn Leu Ser Pro Leu Ala Thr Val 

195 200 205 

GGC GCG GAT GTC ACA ATT GCC TGG GAC CTG GTG CGC AAG GTC AAC TGG 6 72 

Gly Ala Asp Val Thr He Ala Trp Asp Leu Val Arg Lys Val Asn Trp 

210 215 220 

AAG GAC CCG CTG GTG GTG CAC AGC AAC ATG GAG CAC GAC GTG GCA CTG 720 
Lvs Asp Pro Leu Val Val His Ser Asn Met Glu His Asp Val Ala Leu 
225 230 235 240 

CTG CGC CTC TAC CCT GGC ATC CCG GCC TCC CTG GTC CGG GCA TTC CTG 768 
Leu Arq Leu Tyr Pro Gly He Pro Ala Ser Leu Val Arg Ala Phe Leu 

245 250 255 

CAG CCC CCG CTC AAG GGC GTG GTC CTG GAG ACC TTC GGC TCT GGC AAC 816 
Gin Pro Pro Leu Lys Gly Val Val Leu Glu Thr Phe Gly Ser Gly Asn 

260 265 270 

GGG CCG AGC AAG CCC GAC CTG CTG CAG GAG TTG CGG GCC GCG GCC CAG 864 
Gly Pro Ser Lys Pro Asp Leu Leu Gin Glu Leu Arg Ala Ala Ala Gin 

275 280 285 

CGC GGC CTC ATC ATG GTC AAC TGC AGC CAG TGC CTG CGG GGG TCT GTG 912 
Arg Gly Leu He Met Val Asn Cys Ser Gin Cys Leu Arg Gly Ser Val 
290 295 300 

35 ACC CCG GGC TAT GCC ACG AGC TTG GCG GGC GCC AAC ATC GTG TCC GGC 96 0 

Thr Pro Gly Tyr Ala Thr Ser Leu Ala Gly Ala Asn He Val Ser Gly 
305 310 315 320 

TTA GAC ATG ACC TCA GAG GCC GCG CTG GCT AAG CTG TCC TAC GTG TTG 1008 
Leu Asp Met Thr Ser Glu Ala Ala Leu Ala Lys Leu Ser Tyr Val Leu 

325 330 335 

GGC CTG CCG GAG CTG AGC CTG GAG CGC AGG CAG GAG CTG CTG GCC AAG 105 6 
Glv Leu Pro Glu Leu Ser Leu Glu Arg Arg Gin Glu Leu Leu Ala Lys 

340 345 350 

GAT CTT CGC GGG GAA ATG ACA CTG CCC ACG GCA GAC CTG CAC CAG TCC 1104 
Asp Leu Arg Gly Glu Met Thr Leu Pro Thr Ala Asp Leu His Gin Ser 

355 360 365 

TCT CCG CCG GGC AGC ACA CTG GGG CAA GGT GTC GCC CGG CTC TTT AGT 1152 
Ser Pro Pro Gly Ser Thr Leu Gly Gin Gly Val Ala Arg Leu Phe Ser 

370 375 380 

CTG TTC GGT TGC CAG GAG GAA GAT TCG GTG CAG GAC GCC GTG ATG CCC 1200 
Leu Phe Gly Cys Gin Glu Glu Asp Ser Val Gin Asp Ala Val Met Pro 
so 385 390 395 400 

AGC CTG GCC CTG GCC TTG GCC CAT GCT GGT GAA CTC GAG GCT CTG CAG 124 8 
Ser Leu Ala Leu Ala Leu Ala His Ala Gly Glu Leu Glu Ala Leu Gin 



30 
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A ft C 

4 0 o 










A 1 f\ 










f* J. 3 






GCA 


CTT 


ATG 


GAG 




GGL 


AGT 


GAL 


L 1 G 


LVjL 








1 L 1 


AA1 


GGL 


1 O Q C 


TV 1 ^ 

A J. a 


Leu 




blu 

A ft 


Leu 


m v 

ul y 




Asp 


Leu 




Leu 


Lys 


Asp 


Ser 

4 ^ A 


Son 


vjxy 




CAA 


ACC 


CTG 


TIG 


LA I 


GIG 


(jL I 


GL I 


HjG 


AA 1 


GGG 


CGT 


HIT 


V?V»L 


GIG 


GIL 


1 T A A 
-L _> ft 


Gin 


Thr 


Leu 
435 


Leu 


ri x s 


VcLX 


Ala 
Ala 


Ala 

440 


Arg 


noil 


Gly 


Arg 


ft ft D 




v a jl 


val 




ACC 


ATG 


CTG 


CTG 


LAC 


AGA 


GGG 


ATG 


GAT 


GIL 


AAT 


GCC 






Cr~> tv 


par 


1 "l Q O 


Thr 


Met 
450 


Leu 


Leu 


nlS 


Arg 


Giy 
455 


Met 


Asp 


val 


Asn 


Ala 
460 




Asp 


Arg 


As p 




GGC 


CTC 


AGC 


CCA 


CTG 


CTG 


TTG 


GCT 


GTA 


LAG 


GGC 


AGG 


/"•TAT 1 

LA i 


LGG 


GAA 


IGL 


t a a n 
Ifi fi U 


Gly 


Leu 


Ser 


Pro 


Leu 


Leu 


Leu 


Ala 


val 


Gxn 


Gly Arg 


nio 


Arg 




Cys 




465 










470 










475 










^ Q ft 

4 8 0 




ATC 


AGG 


CTG 


CTG 


CGG 


AAG 


GCT 


GGG 


GCC 


TGC 


CTG 


TCC 


ccc 


r* t\ /~« 
LAG 


GAC 


CTG 


14 8 8 


He 


Arg 


Leu 


Leu 


Arg 


Lys 


Ala 

Ala 


Gly Ala 


Cys 


Leu 


Ser 


IT -L <w> 


m n 

VJXU 


Asp 


Leu 










485 










4 90 










4 9 5 






AAG 


GAT 


GCA 


GGG 


ACC 


GAG 


CTG 


TGC 


AGG 


CTG 


GCA 


TCC 


AGG 


GL I 


GAC 


ATG 


lb jo 


Lys 


Asp 


Ala 


Gly 


Thr 


Glu 


Leu 


Cys 


Arg 


Leu 


Ala 


Ser 


Arg 


Aia 


Asp 


Met 






500 










505 










oiu 








GAA 


GGC 


CTG 


CAG 


GCA 


TGG 


GGG 


CAG 


GCT 


GGG 


GCC 


GAC 




V-Avj 






1 c: Q A 
i 3 o ** 


Glu 


Gly 


Leu 


Gin 


Ala 


Trp 


Gly 


Gin 


Ala 


Gly 


Ala 


Asp 


Leu 


Gin 


Gin 


Pro 






515 










520 










c o c 










GGC 


TAT 


GAT 


GGG 


CGC 


AGC 


GCT 


CTG 


TGT 


GTC 


GCA 


GAA 


GCA 


GCC 


GGG 


AAC 


1632 


Gly 


Tyr 


Asp 


Gly 


Arg 


Ser 


Ala 


Leu 


Cys 


Val 


Ala 


Glu 


Ala 


Ala 


Gly 


Asn 




530 










535 










540 












CAG 


GAG 


GTG 


CTG 


GCC 


CTT 


CTG 


CGG 


AAC 


CTG 


GCA 


CTT 


GTA 


GGC 


CCG 


GAA 


1680 


Gin 


Glu 


Val 


Leu 


Ala 


Leu 


Leu 


Arg 


Asn 


Leu 


Ala 


Leu 


Val 


Gly 


Pro 


Glu 




545 










550 










555 










560 




GTG 


CCG 


CCT 


GCC 


ATC 
























1695 


Val 


Pro 


Pro 


Ala 


He 
565 
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(19) INFORMATION FOR SEQ ID NO: 18: 

( i ) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 1719 base pairs 

(B) TYPE mucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : CDNA to mRNA 

(iii) HY POTHET I CAL : No 

( iv) ANTI -SENSE : No 
(vi) ORIGINAL SOURCE: 

40 CA) ORGANISM : human 

(F) TISSUE TYPE: liver 
(iX) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1. .1719 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

45 

ATG GCG CGC GCG GTG GGG CCC GAG CGG AGG CTG CTG GCC GTC TAC ACC 48 
Met Ala Arg Ala Val Gly Pro Glu Arg Arg Leu Leu Ala Val Tyr Thr 
15 10 15 

GGC GGC ACC ATT GGC ATG CGG AGT GAG CTC GGC GTG CTT GTG CCC GGG 96 
Gly Gly Thr lie Gly Met Arg Ser Glu Leu Gly Val Leu Val Pro Gly 
so 20 25 30 
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70 



15 



20 



25 



30 



35 



40 



45 



50 



TV /"*/"* 

ALG 


GGl 




^jL i 


GLL. 


AIL 


L Hj 


AGvr 


TV TV 

ALA 


L IG 


LLL 




11L 


LAI 




GAG 


144 


TVi v 

i nr 




Leu 


Ala 


7\ 1 -s 




JjcU 


my 


i. 111. 


T on 






file 


TJ-i c: 
rli 5 


Asp 


GlU 


















a n 
4 U 










A C 








GAG 


CAL 




Lvj/\ 


net* 

Vj\-V_ 


LvjV— 


pp/-> 

GGL 


L I L 


ILL 


VjAVj 




ALL 


Lib 


GIG 


v_ r a 


CCC 


192 


G1U 


HIS 


Aid 


Arg 




Arg 


Giy 


Leu 




vjlU 


Asp 


I nr 


Leu 


Val 


Leu 


Pro 






50 




















b 0 












CCG 


GAC 


AGC 


CGL 


TV 7V /*"» 


LAG 


AGG 


7» rp f~* 

A 1 l 


prpp 

l jtl 




ALL 


GIG 


Lib 


GAG 


TGC 


CAG 


240 


Pro 


Asp 


Ser 


Arg 


Asn 


bin 


Arg 


lie 


Leu 


Tyr 


inr 


vai 


Leu 


GlU 


Cys 


/IT _ 

Gin 




65 










70 










75 










8 0 




CCC 


CTC 


TTC 


GAC 


TCC 


■jk pm 
AG I 


GAC 


ATG 


» po 

All 


ATC 


GCT 


GAG 


TGG 


GTT 


CGC 


GTT 


288 


Pro 


Leu 


Phe 


Asp 


faer 


Ser 


Asp 


Met 


Tnr 


lie 


Ala 


GlU 


Trp 


vai 


7V t~*~r 

Arg 


vai 












85 










90 










95 






GCC 


CAG 


ACC 


ATC 


AAG 


AGG 


CAC 


TAC 


GAG 


CAG 


TAC 


CAC 


GGC 


TTT 


GTG 


GTC 


336 


Ala 


Gin 


Thr 


He 


Lys 


Arg 


HIS 


Tyr 


GlU 


Gin 


Tyr 


HIS 


Gly 


rfie 


vai 


vai 










100 










105 










110 








ATC 


CAC 


GGC 


ACC 


GAC 


ACC 


ATG 


GCC 


TTT 


GCT 


GCC 


TCG 


ATG 


CTG 


TCC 


TTC 


384 


lie 


His 


Gly Thr 


Asp 


Thr 


Met 


Ala 


Phe 


Ala 


Ala 


Ser 


Met 


Leu 


Ser 


Phe 








115 










120 










12 5 










ATG 


CTG 


GAG 


AAC 


CTG 


CAG 


AAG 


ACT 


GTC 


ATC 


CTC 


ACT 


GGG 


GCC 


CAG 


GTG 


432 


Met 


Leu 


Glu 


Asn 


Leu 


Gin 


Lys 


Thr 


val 


lie 


Leu 


Tnr 


Gly 


Ala 


Gin 


vai 






130 










135 










140 












CCC 


ATC 


CAT 


GCC 


CTG 


TGG 


AGC 


GAC 


GGC 


CGT 


GAG 


AAC 


CTG 


CTG 


GGG 


GCA 


480 


Pro 


lie 


His 


Ala 


Leu 


Trp 


Ser 


Asp 


Gly 


Arg 


Glu 


Asn 


Leu 


Leu 


Gly 


Ala 




145 










150 










155 










160 




CTG 


CTC 


ATG 


GCT 


GGC 


CAG 


TAT 


GTG 


ATC 


CCA 


GAG 


GTC 


TGC 


CTT 


TTC 


TTC 


528 


Leu 


Leu 


Met 


Ala 


Gly 


Gin 


Tyr 


vai 


i ie 


Pro 


olu 


vai 


Cys 


Leu 


fne 


irne 












165 










17 0 










175 






CAG 


AAT 


CAG 


CTG 


TTT 


CGG 


GGC 


AAC 


CGG 


GCA 


ACC 


AAG 


GTA 


GAC 


GCT 


CGG 


576 


Gin 


Asn 


Gin 


Leu 


Phe 


Arg 


Gly 


Asn 


Arg 


Ala 


Thr 


Lys 


val 


Asp 


TV 1 -% 

Aia 


Arg 










180 










185 










190 








AGG 


TTC 


GCA 


GCT 


TTC 


TGC 


TCC 


CCG 


AAC 


CTG 


CTG 


CCT 


CTG 


GCC 


ACA 


GTG 


624 


Arg 


Phe 


Ala 


Ala 


Phe 


Cys 


Ser 


Pro 


Asn 


Leu 


Leu 


Pro 


Leu 


Ala 


Thr 


vai 






195 










200 










205 










GGT 


GCT 


GAC 


ATC 


ACA 


ATC 


AAC 


AGG 


GAG 


CTG 


GTG 


CGG 


AAG 


GTG 


GAC 


GGG 


672 


Gly 


Ala 


Asp 


He 


Thr 


lie 


Asn 


Arg 


GlU 


Leu 


vai 


Arg 


Lys 


IT- 1 

vai 


Asp 


Giy 






210 










215 










2 2 0 












AAG 


GCT 


GGG 


CTG 


GTG 


GTG 


CAC 


AGC 


AGC 


ATG 


GAG 


CAG 


GAC 


GTG 


GGC 


CTG 


720 


Lys 


Ala 


Gly 


Leu 


val 


vai 


rll S 


Ser 


Ser 


Met 


GlU 


Gin 


Asp 


vai 


Giy 


Leu 




225 










23 0 










235 










24 0 




CTG 


CGC 


CTC 


TAC 


CCT 


GGG 


ATC 


CCT 


GCC 


GCC 


CTG 


GTT 


CGG 


GCC 


TTC 


TTG 


768 


Leu 


Arg 


Leu 


Tyr 


Pro 


Giy 


ne 


pro 


Aia 


Aia 


Leu 


vai 


Arg 


Aia 


fne 


Leu 












245 










250 










255 






CAG 


CCT 


CCC 


CTG 


AAG 


GGC 


GTG 


GTC 


ATG 


GAG 


ACC 


TTC 


GGT 


TCA 


GGG 


AAC 


816 


Gin 


Pro 


Pro 


Leu 


Lys 


Gly 


vai 


val 


Met 


Glu 


Tnr 


Phe 


Gly 


Ser 


Gly 


Asn 










260 










265 










270 








GGA 


CCC 


ACC 


AAG 


CCC 


GAC 


CTG 


CTG 


CAG 


GAG 


CTG 


CGG 


GTG 


GCC 


ACC 


GAG 


864 


Gly 


Pro 


Thr 


Lys 


Pro 


Asp 


Leu 


Leu 


Gin 


Glu 


Leu 


Arg 


Val 


Ala 


Tnr 


Glu 








275 










280 










285 










CGC 


GGC 


CTG 


GTC 


ATC 


GTC 


AAC 


TGT 


ACC 


CAC 


TGC 


CTC 


CAG 


GGG 


GCT 


GTG 


912 


Arg 


Gly 


Leu 


Val 


He 


Val 


Asn 


Cys 


Thr 


His 


Cys 


Leu 


Gin 


Gly 


Ala 


Val 






290 










295 










300 












ACC 


ACA 


GAC 


TAT 


GCA 


GCT 


GGC 


ATG 


GCC 


ATG 


GCG 


GGA 


GCC 


GGC 


GTC 


ATC 


960 


Thr 


Thr 


Asp 


Tyr 


Ala 


Ala 


Gly 


Met 


Ala 


Met 


Ala 


Gly 


Ala 


Gly 


Val 


He 




305 










310 










315 










320 




TCA 


GGC 


TTC 


GAC 


ATG 


ACA 


TCG 


GAG 


GCC 


GCC 


CTG 


GCC 


AAG 


CTA 


TCG 


TAT 


1008 


Ser 


Gly 


Phe 


Asp 


Met 


Thr 


Ser 


Glu 


Ala 


Ala 


Leu 


Ala 


Lys 


Leu 


Ser 


Tyr 








325 










330 










335 
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GTG 


CTG 


GGC 


CAG 


CCA 


GGG 


CTG 


AGC 


CTG 


GAT 


GTC 


AGG 


AAG 


GAG 


CTG 


CTG 


1056 


Val 


Leu 


Gly 


Gin 
340 


Pro 


Gly 


Leu 


Ser 


Leu 
345 


Asp 


Val 


Arg 


Lys 


Glu 
350 


Leu 


Leu 




ACC 


AAG 


GAC 


CTT 


CGG 


GGG 


GAG 


ATG 


ACG 


CCA 


CCC 


TCG 


GTG 


GAA 


GAG 


CGC 


1104 


Thr 


Lys 


Asp 


Leu 


Arg 


Gly 


Glu 


Met 


Thr 


Pro 


Pro 


Ser 


Val 


Glu 


Glu 


Arg 








355 










360 










365 








CGG 


CCC 


TCA 


CTG 


CAG 


GGC 


AAC 


ACG 


CTG 


GGC 


GGT 


GGG 


GTC 


TCC 


TGG 


CTC 


1152 


Arg 


Pro 


Ser 


Leu 


Gin 


Gly 


Asn 


Thr 


Leu 


Gly Gly Gly 


Val 


Ser 


Trp 


Leu 






370 










375 










380 












CTC 


AGT 


CTG 


AGC 


GGC 


AGC 


CAG 


GAG 


GCA 


GAT 


GCC 


CTG 


CGG 


AAT 


GCC 


CTG 


1200 


Leu 


Ser 


Leu 


Ser 


Gly 


Ser 


Gin 


Glu 


Ala 


Asp 


Ala 


Leu 


Arg 


Asn 


Ala 


Leu 




385 










390 










395 










400 




GTG 


CCC 


AGC 


CTG 


GCC 


TGT 


GCT 


GCT 


GCC 


CAC 


GCC 


GGT 


GAC 


GTG 


GAG 


GCG 


1248 


Val 


Pro 


Ser 


Leu 


Ala 
405 


Cys 


Ala 


Ala 


Ala 


His 
410 


Ala 


Gly 


Asp 


Val 


Glu 
415 


Ala 




CTG 


CAG 


GCG 


CTT 


GTG 


GAG 


CTG 


GGC 


AGT 


GAC 


CTG 


GGC 


CTG 


GTG 


GAC 


TTT 


1296 


Leu 


Gin 


Ala 


Leu 
420 


Val 


Glu 


Leu 


Gly 


Ser 
425 


Asp 


Leu 


Gly 


Leu 


Val 
430 


Asp 


Phe 




AAC 


GGC 


CAA 


ACC 


CCA 


CTG 


CAC 


GCG 


GCC 


GCC 


CGG 


GGA 


GGC 


CAC 


ACA 


GAG 


1344 


Asn 


Gly Gin 


Thr 


Pro 


Leu 


His 


Ala 


Ala 


Ala 


Arg 


Gly 


Gly 


His 


Thr 


Glu 








435 










440 










445 










GCA 


GTC 


ACC 


ATG 


CTG 


CTG 


CAG 


AGA 


GGT 


GTG 


GAC 


GTG 


AAC 


ACC 


CGG 


GAC 


1392 


Ala 


Val 
450 


Thr 


Met 


Leu 


Leu 


Gin 
455 


Arg 


Gly 


Val 


Asp 


Val 

460 


Asn 


Thr 


Arg 


Asp 




ACG 


GAT 


GGC 


TTC 


AGC 


CCG 


CTG 


CTG 


CTG 


GCC 


GTG 


CGG 


GGC 


AGG 


CAT 


CCG 


1440 


Thr 


Asp 


Gly 


Phe 


Ser 


Pro 


Leu 


Leu 


Leu 


Ala 


Val 


Arg 


Gly 


Arg 


His 


Pro 




465 








470 










475 










480 




GGT 


GTC 


ATT 


GGG 


TTG 


CTG 


CGG 


GAA 


GCC 


GGG 


GCC 


TCC 


CTG 


TCC 


ACC 


CAG 


1488 


Gly 


Val 


lie 


Gly 


Leu 


Leu 


Arg 


Glu 


Ala 


Gly Ala 


Ser 


Leu 


Ser 


Thr 


Gin 










485 










490 










495 






GAG 


CTG 


GAG 


GAA 


GCA 


GGG 


ACG 


GAG 


CTG 


TGC 


AGG 


CTG 


GCA 


TAC 


AGG 


GCC 


1536 


Glu 


Leu 


Glu 


Glu 
500 


Ala 


Gly 


Thr 


Glu 


Leu 
505 


Cys 


Arg 


Leu 


Ala 


Tyr 
510 


Arg 


Ala 




GAC 


CTC 


GAA 


GGC 


CTG 


CAG 


GTG 


TGG 


TGG 


CAG 


GCA 


GGG 


GCT 


GAC 


CTG 


GGG 


1584 


Asp 


Leu 


Glu 
515 


Gly 


Leu 


Gin 


Val 


Trp 
520 


Trp 


Gin 


Ala 


Gly 


Ala 

525 


Asp 


Leu 


Gly 




CAG 


CCG 


GGC 


TAT 


GAC 


GGG 


CAC 


AGC 


GCC 


CTG 


CAC 


GTC 


GCA 


GAG 


GCA 


GCC 


1632 


Gin 


Pro 
530 


Gly 


Tyr 


Asp 


Gly 


His 
535 


Ser 


Ala 


Leu 


His 


Val 
540 


Ala 


Glu 


Ala 


Ala 




GGG 


AAC 


CTG 


GCA 


GTG 


GTG 


GCC 


TTT 


CTA 


CAG 


AGC 


CTG 


GAG 


GGT 


GCG 


GTT 


1680 


Gly Asn 


Leu 


Ala 


Val 


val 


Ala 


Phe 


Leu 


Gin 


Ser 


Leu 


Glu 


Gly 


Ala 


Val 




545 










550 










555 










560 




GGT 


GCC 


CAG 


GCC 


CCA 


TGC 


CCA 


GAA 


GTG 


CTG 


CCT 


GGT 


GTC 








1716 


Gly Ala 


Gin 


Ala 


Pro 


Cys 


Pro 


Glu 


Val 


Leu 


Pro 


Gly 


Val 











40 565 570 573 



45 Claims 

1. A polypeptide which originate from mammal, having L-asparaginase activity. 

2. The polypeptide of claim 1 , which is obtainable by the expression of a gene originating from mammal. 

so 

3. The polypeptide of claim 1, which has amino acid sequences of SEQ ID NOs:1 to 3 (where the symbol "Xaa" 
means u glutamine n or "threonine"); 



55 



47 

BNSDOCID: <EP 081 1687A2_I_> 



EP0 811 687 A2 



10 



15 



20 



30 



35 



SEQ ID NO: 1: 

Thr Gly Gly Thr 
1 



SEQ ID NO: 2: 

His Gly Thr Asp Thr 
1 5 



SEQ ID NO: 3: 

Gin Cys Leu Xaa Gly. 
1 5 



25 4. The polypeptide of claim 1 , which has an amino acid sequence selected from the group consisting of SEQ ID NOs: 
4 to 9 and homologous amino acid sequences thereunto; 



SEQ ID NO: 4: 

Met Ala Arg Ala Ser Gly Ser Glu Arg His Leu Leu Leu lie Tyr Thr 
15 10 15 

Gly Gly Thr Leu Gly Met Gin Ser Lys Gly Gly Val Leu Val Pro Gly 

20 25 30 

Pro Gly Leu Val Thr Leu Leu Arg Thr Leu Pro Met Phe His Asp Lys 

35 40 45 

Glu Phe Ala Gin Ala Gin Gly Leu Pro Asp His Ala Leu Ala Leu Pro 
50 55 60 



40 



45 



50 
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20 



Pro Ala Ser His Gly Pro Arg Val Leu Tyr Thr Val Leu Glu Cys Gin 
65 70 75 80 

Pro Leu Leu Asp Ser Ser Asp Met Thr He Asp Asp Trp He Arg He 

85 90 95 

Ala Lys He He Glu Arg His Tyr Glu Gin Tyr Gin Gly Phe Val Val 

100 105 110 

He His Gly Thr Asp Thr Met Ala Phe Gly Ala Ser Met Leu Ser Phe 

115 120 125 

Met Leu Glu Asn Leu His Lys Pro Val He Leu Thr Gly Ala Gin Val 

130 135 140 

Pro He Arq Val Leu Trp Asn Asp Ala Arg Glu Asn Leu Leu Gly Ala 
145 150 155 160 

Leu Leu Val Ala Gly Gin Tyr He He Pro Glu Val Cys Leu Phe Met 

165 170 175 

Asn Ser Gin Leu Phe Arg Gly Asn Arg Val Thr Lys Val Asp Ser Gin 

180 185 190 

Lys Phe Glu Ala Phe Cys Ser Pro Asn Leu Ser Pro Leu Ala Thr Val 

195 200 205 

Glv Ala Asp Val Thr He Ala Trp Asp Leu Val Arg Lys Val Asn Trp 

210 215 220 

Lvs Asd Pro Leu Val Val His Ser Asn Met Glu His Asp Val Ala Leu 
225 230 235 240 

Leu Arq Leu Tyr Pro Gly He Pro Ala Ser Leu Val Arg Ala Phe Leu 

245 250 255 

Gin Pro Pro Leu Lys Gly Val Val Leu Glu Thr Phe Gly Ser Gly Asn 

260 265 270 

Gly Pro Ser Lys Pro Asp Leu Leu Gin Glu Leu Arg Ala Ala Ala Gin 

275 280 285 

Arq Gly Leu He Met Val Asn Cys Ser Gin Cys Leu Arg Gly Ser Val 
30 290 29 5 300 

Thr Pro Gly Tyr Ala Thr Ser Leu Ala Gly Ala Asn He Val Ser Gly 
305 310 315 320 

Leu Asp Met Thr Ser Glu Ala Ala Leu Ala Lys Leu Ser Tyr Val Leu 

325 330 335 

35 Gly Leu Pro Glu Leu Ser Leu Glu Arg Arg Gin Glu Leu Leu Ala Lys 

340 345 350 

Asp Leu Arg Gly Glu Met Thr Leu Pro Thr Ala 
355 360 363 

40 



25 



45 



50 
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SEQ ID NO: 5: 

Met Ala Arg Ala Ser Gly Ser Glu Arg His Leu Leu Leu lie Tyr Thr 
1 5 10 15 

Glv Gly Thr Leu Gly Met: Gin Ser Lys Gly Gly Val Leu Val Pro Gly 

20 25 30 

Pro Gly Leu Val Thr Leu Leu Arg Thr Leu Pro Met Phe His Asp Lys 

35 40 45 

Glu Phe Ala Gin Ala Gin Gly Leu Pro Asp His Ala Leu Ala Leu Pro 

50 55 60 

Pro Ala Ser His Gly Pro Arg Val Leu Tyr Thr Val Leu Glu Cys Gin 
65 70 75 80 

Pro Leu Leu Asp Ser Ser Asp Met Thr He Asp Asp Trp He Arg He 
is 85 90 95 

Ala Lys He He Glu Arg His Tyr Glu Gin Tyr Gin Gly Phe Val Val 
100 105 110 



10 



20 



25 



30 



35 



40 



45 
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He His Gly Thr Asp Thr Met Ala Phe Gly Ala Ser Met Leu Ser Phe 

115 120 125 

Met Leu Glu Asn Leu His Lys Pro Val He Leu Thr Gly Ala Gin Val 

130 135 140 

Pro He Arg Val Leu Trp Asn Asp Ala Arg Glu Asn Leu Leu Gly Ala 
145 150 155 160 

Leu Leu Val Ala Gly Gin Tyr He He Pro Glu Val Cys Leu Phe Met 

165 170 175 

Asn Ser Gin Leu Phe Arg Gly Asn Arg Val Thr Lys Val Asp Ser Gin 

180 185 190 

Lys Phe Glu Ala Phe Cys Ser Pro Asn Leu Ser Pro Leu Ala Thr Val 

195 200 205 

Glv Ala Asp Val Thr He Ala Trp Asp Leu Val Arg Lys Val Asn Trp 

210 215 220 

Lvs Asc Pro Leu Val Val His Ser Asn Met Glu His Asp Val Ala Leu 
225 230 235 240 

Leu Arg Leu Tyr Pro Gly He Pro Ala Ser Leu Val Arg Ala Phe Leu 

245 250 255 

Gin Pro Pro Leu Lys Gly Val Val Leu Glu Thr Phe Gly Ser Gly Asn 

260 265 270 

Gly Pro Ser Lys Pro Asp Leu Leu Gin Glu Leu Arg Ala Ala Ala Gin 

275 280 285 

Ara Gly Leu He Met Val Asn Cys Ser Gin Cys Leu Arg Gly Ser Val 

290 295 300 

Thr Pro Gly Tyr Ala Thr Ser Leu Ala Gly Ala Asn He Val Ser Gly 
305 310 315 320 

Leu Asp Met Thr Ser Glu Ala Ala Leu Ala Lys Leu Ser Tyr Val Leu 

325 330 335 

Gly Leu Pro Glu Leu Ser Leu Glu Arg Arg Gin Glu Leu Leu Ala Lys 

340 345 350 

Asp Leu Arg Gly Glu Met Thr Leu Pro Thr Ala Asp Leu His Gin Ser 

355 360 365 

Ser Pro Pro Gly Ser Thr Leu Gly Gin Gly Val Ala Arg Leu Phe Ser 

370 375 380 

Leu Phe Gly Cys Gin Glu Glu Asp Ser Val Gin Asp Ala Val Met Pro 
385 390 395 400 

Ser Leu Ala Leu Ala Leu Ala His Ala Gly Glu Leu Glu Ala Leu Gin 

405 410 415 

Ala Leu Met Glu Leu Gly Ser Asp Leu Arg Leu Lys Asp Ser Asn Gly 

420 425 430 

Gin Thr Leu Leu His Val Ala Ala Arg Asn Gly Arg Asp Gly Val Val 

435 440 445 

Thr Met Leu Leu His Arg Gly Met Asp Val Asn Ala Arg Asp Arg Asp 

450 455 460 

Gly Leu Ser Pro Leu Leu Leu Ala Val Gin Gly Arg His Arg Glu Cys 
465 470 475 480 

He Arg Leu Leu Arg Lys Ala Gly Ala Cys Leu Ser Pro Gin Asp Leu 

485 490 495 

Lys Asp Ala Gly Thr Glu Leu Cys Arg Leu Ala Ser Arg Ala Asp Met 

500 505 510 

Glu Gly Leu Gin Ala Trp Gly Gin Ala Gly Ala Asp Leu Gin Gin Pro 

515 520 525 

Gly Tyr Asp Gly Arg Ser Ala Leu Cys Val Ala Glu Ala Ala Gly Asn 

530 535 540 

Gin Glu Val Leu Ala Leu Leu Arg Asn Leu Ala Leu Val Gly Pro Glu 
545 550 555 560 

Val Pro Pro Ala He 
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SEQ ID NO: 6: 



10 


Met 


Ala 


Arg 


Ala 


Val 


Gly 


Pro 


Glu 


Arg 


Arg 


Leu 


Leu 


Ala 


Val 


Tyr 


Thr 




1 






5 










10 










15 






Gly 


Gly 


Thr 


He 


Gly Met 


Arg 


Ser 


Glu 


Leu 


Gly 


Val 


Leu 


Val 


Pro 


Gly 






20 










25 










30 








Thr 


Gly 


Leu 


Ala 


Ala 


He 


Leu 


Arg 


Thr 


Leu 


Pro 


Met 


Phe 


His 


Asp 


Glu 


15 




35 










40 










45 










Glu 


His 
50 


Ala 


Arg 


Ala 


Arg 


Gly 
55 


Leu 


Ser 


Glu 


Asp 


Thr 
60 


Leu 


Val 


Leu 


Pro 




Pro 


Asp 


Ser 


Arg 


Asn 


Gin 


Arg 


He 


Leu 


Tyr 


Thr 


Val 


Leu 


Glu 


Cys 


Gin 




65 






70 










75 










80 


20 


Pro 


Leu 


Phe 


Asp 


Ser 
85 


Ser 


Asp 


Met 


Thr 


He 
90 


Ala 


Glu 


Trp 


Val 


Arg 
95 


Val 




Ala 


Gin 


Thr 


He 


Lvs 


Arg 


His 


Tyr 


Glu 


Gin 


Tyr 


His 


Gly 


Phe 


Val 


Val 










100 








105 










110 






lie 


His 


Gly 


Thr 


Asp 


Thr 


Met 


Ala 


Phe 


Ala 


Ala 


Ser 


Met 


Leu 


Ser 


Phe 


25 






115 










120 










125 








Met: 


Leu 


Glu 


Asn 


Leu 


Gin 


Lys 


Thr 


Val 


He 


Leu 


Thr 


Gly 


Ala 


Gin 


Val 






130 










135 










140 










Pro 


lie 


His 


Ala 


Leu 


Trp 


Ser 


Asp 


Gly 


Arg 


Glu 


Asn 


Leu 


Leu 


Gly 


Ala 




145 










150 










155 










160 


30 


Leu 


Leu 


Met 


Ala 


Gly 


Gin 


Tyr 


Val 


He 


Pro 


Glu 


Val 


Cys 


Leu 


Phe 


Phe 












165 










170 








175 




Gin 


Asn 


Gin 


Leu 


Phe 


Arg 


Gly Asn 


Arg 


Ala 


Thr 


Lys 


Val 


Asp 


Ala 


Arg 










180 










185 










190 






Arg 


Phe 


Ala 


Ala 


Phe 


Cys 


Ser 


Pro 


Asn 


Leu 


Leu 


Pro 


Leu 


Ala 


Thr 


Val 


35 




195 










200 










205 








Gly 


Ala 


Asp 


He 


Thr 


He 


Asn Arg 


Glu 


Leu 


Val 


Arg 


Lys 


Val 


Asp 


Gly 




210 








215 










220 










Lys 


Ala 


Gly 


Leu 


Val 


Val 


His 


Ser 


Ser 


Met 


Glu 


Gin 


Asp 


Val 


Gly 


Leu 




225 








230 










235 










240 


40 


Leu 


Arg 


Leu 


Tyr 


Pro 


Gly 


He 


Pro 


Ala 


Ala 


Leu 


Val 


Arg 


Ala 


Phe 


Leu 








245 










250 








255 




Gin 


Pro 


Pro 


Leu 


Lys 


Gly 


Val 


Val 


Met 


Glu 


Thr 


Phe 


Gly 


Ser 


Gly 


Asn 










260 






265 










270 






Gly 


Pro 


Thr 


Lys 


Pro 


Asp 


Leu 


Leu 


Gin 


Glu 


Leu 


Arg 


Val 


Ala 


Thr 


Glu 


45 




275 






280 










285 






Arg Gly 


Leu 


Val 


He 


Val 


Asn Cys 


Thr 


Gin Cys 


Leu 


Arg 


Gly 


Ala 


Val 






290 










295 










300 










Thr 


Thr 


Asp 


Tyr 


Ala 


Ala 


Gly Met 


Ala 


Met 


Ala 


Gly 


Ala 


Asn 


Val 


He 




305 






310 










315 










320 


50 


Ser 


Gly 


Phe 


Asp 


Met 


Thr 


Ser 


Glu 


Ala 


Ala 


Leu 


Ala 


Lys 


Leu 


Ser 


Tyr 








325 










330 








335 




Val 


Leu 


Gly 


Gin 


Pro Gly 


Leu 


Ser 


Leu 


Asp 


Val 


Arg 


Lys 


Glu 


Leu 


Leu 








340 










345 










350 






Thr 


Lys 


Asp 


Leu 


Arg 


Gly 


Glu 


Met 


Thr 


Pro 


» Pro Ser Val 






55 
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360 










365 
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SEQ ID NO: 7: 



5 





Met 


Ala 


Arg 


Ala 


Val 


Gly 


Pro 




1 . 








5 






10 


Gly 


Gly 


Thr 


He 


Gly Met 


Arg 








20 










Thr 


Gly 


Leu 


Ala 


Ala 


He 


Leu 






35 












Glu 


His 


Ala 


Arg 


Ala 


Ara 


Gly 


IS 




50 










55 




Pro 


Asp 


Ser 


Arg 


Asn 


Gin 


Arg 




65 








70 






Pro 


Leu 


Phe 


Asp 


Ser 
85 


Ser 


Asp 


20 


Ala 


Gin 


Thr 


He 
100 


Lys 


Ara 


His 




lie 


His 


Gly 
115 


Thr 


Asp 


Thr 


Met 




Met 


Leu 


Glu 


Asn 


UC \X 


Gin 


Lys 


25 




130 










135 




Pro 


lie 


His 


Ala 




Trn 


Ser 




145 










150 






Leu 


Leu 


Met 


Ala 


Glv 
165 


Gin 


Tyr 


30 


Gin 


Asn 


Gin 


Leu 
180 


Phe 




Gly 




Arg 


Phe 


Ala 


Ala 


Phe 


Cys 


Ser 






195 












Glv 

w j. j[ 


Ala 


Asp 


He 


Thr 


lie 


Asn 


35 




210 










215 




Lys 


Ala 


Gly 


Leu 


Val 


Val 


His 




225 










230 






Leu 


Arg 


Leu 


Tyr 


Pro 


Gly 


He 










245 






40 


Gin 


Pro 


Pro 


Leu 
260 


Lys 


Gly 


Val 




Gly 


Pro 


Thr 


Lys 


Pro 


Asp 


Leu 






275 












Arg 


Gly 


Leu 


Val 


He 


Val 


Asn 


45 


290 










295 




Thr 


Thr 


Asp 


Tyr 


Ala 


Ala 


Gly 




305 










310 






Ser 


Gly Phe 


Asp 


Met 


Thr 


Ser 


SO 










325 






Val 


Leu 


Gly 


Gin 


Pro Gly 


Leu 










340 










Thr 


Lys 


Asp 


Leu 


Arg 


Gly 


Glu 



355 



Glu 


Arg 


Arg 


Leu 


Leu 


Ala 


Val 


Tyr 


Thr 






10 










15 




Ser 


Glu 


Leu 


Gly 


Val 


Leu 


Val 


Pro 


Gly 




25 










30 






Arg 


Thr 


Leu 


Pro 


Met 


Phe 


His 


Asp 


Glu 


40 










45 








Leu 


Ser 


Glu 


Asp 


Thr 


Leu 


Val 


Leu 


Pro 










60 










lie 


Leu 


Tyr 


Thr 


Val 


Leu 


Glu 


Cys 


Gin 






75 










80 


Met 


Thr 


He 


Ala 


Glu 


Trp 


Val 


Arg 


Val 






90 










95 




Tyr 


Glu 


Gin 


Tyr 


His 


Gly 


Phe 


Val 


Val 


105 










110 




Ala 


Phe 


Ala 


Ala 


Ser 


Met 


Leu 


Ser 


Phe 


120 










125 






Thr 


Val 


He 


Leu 


Thr 


Gly 


Ala 


Gin 


Val 










140 








Asp 


Gly 


Arg 


Glu 


Asn 


Leu 


Leu 


Gly 


Ala 






155 










160 


Val 


He 


Pro 


Glu 


Val 


Cys 


Leu 


Phe 


Phe 






170 








175 


Asn 


Arg 


Ala 


Thr 


Lys 


Val 


Asp 


Ala 


Arg 




185 










190 




Pro 


Asn 


Leu 


Leu 


Pro 


Leu 


Ala 


Thr 


Val 


200 










205 






Arg 


Glu 


Leu 


Val 


Arg 


Lys 


Val 


Asp 


Gly 








220 








Ser 


Ser 


Met 


Glu 


Gin 


Asp 


Val 


Gly 


Leu 








235 










240 


Pro 


Ala 


Ala 


Leu 


Val 


Arg 


Ala 


Phe 


Leu 






250 








255 


Val 


Met 


Glu 


Thr 


Phe 


Gly 


Ser 


Gly Asn 




265 








270 




Leu 


Gin 


Glu 


Leu 


Arg 


Val 


Ala 


Thr 


Glu 


280 










285 






Cys 


Thr 


Gin 


Cys 


Leu 


Arg 


Gly 


Ala 


Val 








300 








Met 


Ala 


Met 


Ala 


Gly 


Ala 


Gly Val 


He 








315 










320 


Glu 


Ala 


Ala 


Leu 


Ala 


Lys 


Leu 


Ser 


Tyr 






330 








335 


Ser 


Leu 


Asp 


Val 


Arg 


Lys 


Glu 


Leu 


Leu 



345 350 



Met Thr Pro Pro Ser Val 
360 365 
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SEQ ID NO: 8: 

5 Met Ala Arg Ala Val Gly Pro Glu 

1 5 
Gly Gly Thr lie Gly Met Arg Ser 
20 

Thr Gly Leu Ala Ala lie Leu Arg 
10 35 40 



Arg Arg Leu Leu Ala Val Tyr Thr 

10 15 
Glu Leu Gly Val Leu Val Pro Gly 

25 30 
Thr Leu Pro Met Phe His Asp Glu 

45 



20 



40 



Glu 


His 


Ala 


Arg 


Ala 


Arg 


Gly 




50 










55 


Pro 


Asp 


Ser 


Arg 


Asn 


Gin 


Arg 


65 










70 




Pro 


Leu 


Phe 


Asp 


Ser 


Ser 


Asp 










85 






Ala 


Gin 


Thr 


lie 


Lys 


Arg 


His 








100 








lie 


His 


Gly 


Thr 


Asp 


Thr 


Met 






115 










Met 


Leu 


Glu 


Asn 


Leu 


Gin 


Lys 




130 










135 


Pro 


lie 


His 


Ala 


Leu 


Trp 


Ser 


145 










150 




Leu 


Leu 


Met 


Ala 


Gly 


Gin 


Tyr 










165 






Gin 


Asn 


Gin 


Leu 


Phe 


Arg 


Gly 








180 








Arg 


Phe 


Ala 


Ala 


Phe 


Cys 


Ser 




195 










Gly 


Ala 


Asp 


He 


Thr 


He 


Asn 


210 










215 


Lys 


Ala 


Gly 


Leu 


Val 


Val 


His 


225 










230 




Leu 


Arg 


Leu 


Tyr 


Pro 


Gly 


He 








245 






Gin 


Pro 


Pro 


Leu 


Lys 


Gly 


Val 








260 








Gly 


Pro 


Thr 


Lys 


Pro 


Asp 


Leu 






275 










Arg 


Gly 


Leu 


Val 


He 


Val 


Asn 


290 










295 


Thr 


Thr 


Asp 


Tyr 


Ala 


Ala 


Gly 


305 










310 




Ser 


Gly 


Phe 


Asp 


Met 


Thr 


Ser 










325 






Val 


Leu 


Gly Gin 


Pro 


Gly 


Leu 








340 








Thr 


Lys 


Asp 


Leu 


Arg 


Gly 


Glu 



355 



Leu 


Ser 


Glu 


Asp 


Thr 
60 


Leu 


Val 


Leu 


Pro 


He 


Leu 


Tyr 


Thr 


Val 


Leu 


Glu 


Cys 


Gin 






75 










80 


Met 


Thr 


He 
90 


Ala 


Glu 


Trp 


Val 


Arg 

95 


Val 


Tyr 


Glu 


Gin 


Tyr 


His 


Gly 


Phe 


Val 


Val 




105 










110 




Ala 


Phe 


Ala 


Ala 


Ser 


Met 


Leu 


Ser 


Phe 


120 










125 






Thr 


Val 


lie 


Leu 


Thr 


Gly 


Ala 


Gin 


Val 










140 








Asp 


Gly Arg 


Glu 


Asn 


Leu 


Leu 


Gly 


Ala 








155 










160 


Val 


He 


Pro 


Glu 


Val 


Cys 


Leu 


Phe 


Phe 






170 








175 


Asn 


Arg 


Ala 


Thr 


Lys 


Val 


Asp 


Ala 


Arg 




185 










190 




Pro 


Asn 


Leu 


Leu 


Pro 


Leu 


Ala 


Thr 


Val 


200 










205 






Arg 


Glu 


Leu 


Val 


Arg 


Lys 


Val 


Asp 


Gly 








220 








Ser 


Ser 


Met 


Glu 
235 


Gin 


Asp 


Val 


Gly 


Leu 
240 


Pro 


Ala 


Ala 


Leu 


Val 


Arg 


Ala 


Phe 


Leu 






250 








255 


Val 


Met 


Glu 


Thr 


Phe 


Gly 


Ser 


Gly 


Asn 




265 










270 




Leu 


Gin 


Glu 


Leu 


Arg 


Val 


Ala 


Thr 


Glu 


280 








285 






Cys 


Thr 


Gin 


Cys 


Leu 


Gin 


Gly 


Ala 


Val 








300 








Met 


Ala 


Met 


Ala 


Gly Ala 


Asn 


Val 


He 








315 










320 


Glu 


Ala 


Ala 


Leu 


Ala 


Lys 


Leu 


Ser 


Tyr 






330 








335 


Ser 


Leu 


Asp 


Val 


Arg 


Lys 


Glu 


Leu 


Leu 



345 350 



Met Thr Pro Pro Ser Val 
360 365 
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SEQ ID NO: 9: 



Met 


Ala 


Arg 


Ala 


Val 


Gly 


Pro 


Glu 


Arg 


Arg 


Leu 


Leu 


Ala 


val 


Tyr 


Tnr 


1 






5 










10 










15 




Gly 


Gly 


Thr 


He 


Gly Met 


Arg 


Ser 


Glu 


Leu 


Gly Val 


Leu 


Val 


Pro 


Gly 




20 










25 










30 






Thr 


Gly 


Leu 


Ala 


Ala 


He 


Leu 


Arg 


Thr 


Leu 


Pro 


Met 


Phe 


His 


Asp 


Glu 




35 










40 










45 








Glu 


His 


Ala 


Arg 


Ala 


Arg 


Gly 


Leu 


Ser 


Glu 


Asp 


Thr 


Leu 


Val 


Leu 


Pro 




50 








55 










60 










Pro 


Asp 


Ser 


Arg 


Asn 


Gin 


Arg 


He 


Leu 


Tyr 


Thr 


Val 


Leu 


Glu 


Cys 


Gin 


65 






70 










75 










80 


Pro 


Leu 


Phe 


Asp 


Ser 


Ser 


Asp 


Met 


Thr 


lie 


Ala 


Glu 


Trp 


Val 


Arg 


Val 



85 90 95 







X X IJ- 


He 


Lys 


Arg 


His Tyr 


Glu 


Gin 


Tyr 


His 


Gly 


Phe 


Val 


Val 






100 




105 










110 




T 1 Q 

lie 


HIS 






Asp 


Thr 


Met Ala 


Phe 


Ala 


Ala 


Ser 


Met 


Leu 


Ser 


Phe 












120 










125 






Met 


Leu 


pi II 


Mall 


T .01 1 


V7X11 


Lys Thr 


Val 


He 


Leu 


Thr Gly 


Ala 


Gin 


Val 




-LOW 










135 








140 






Ala 


Pro 


116 


Lin o 

nis 


a ] a 
nla 


T .01 1 


Trn 


Ser Asn 


Gly 


Arg 


Glu 


Asn 


Leu 


Leu 


Gly 










150 








155 










160 






Met 


Ala 


Gly 


Gin 


Tyr Val 


He 


Pro 


Glu 


Val 


Cys 


Leu 


Phe 


Phe 






165 






170 








175 


Gin 


Asn 


Gin 


Leu 


Phe 


Arg 


Gly Asn 


Arg 


Ala 


Thr 


Lys 


Val 


Asp 


Ala 


Arg 






180 






185 










190 




Arg 


Phe 


Ala 


Ala 


Phe 


Cys 


Ser Pro 


Asn 


Leu 


Leu 


Pro 


Leu 


Ala 


Thr 


Val 




195 








200 










205 






Gly 


Ala 


Asp 


He 


Thr 


He 


Asn Arg 


Glu 


Leu 


Val 


Arg 


Lys 


Val 


Asp 


Gly 


210 








215 








220 


Val 


Gly 




Lys 
225 


Ala 


Gly 


Leu 


Val 


Val 

230 


His Ser 


Ser 


Met 


Glu 
235 


Gin 


Asp 


Leu 
240 


Leu 


Arg 


Leu 


Tyr 


Pro 


Gly 


He Pro 


Ala 


Ala 


Leu 


Val 


Arg 


Ala 


Phe 


Leu 






245 








250 










Gin 


Pro 


Pro 


Leu 


Lys 


Gly 


Val Val 


Met 


Glu 


Thr 


Phe 


Gly 


Ser 


Gly 


Asn 






260 






265 








270 




Gly 


Pro 


Thr 


Lys 


Pro 


Asp 


Leu Leu 


Gin 


Glu 


Leu 


Arg 


Val 


Ala 


Thr 


Glu 




275 






280 










285 




Val 


Arg 


Gly 


Leu 


Val 


He 


Val 


Asn Cys 


Thr 


Gin 


Cys 


Leu 


Gin 


Gly 


Ala 


290 










295 








300 




Val 


He 


Thr 


Thr 


Asp 


Tyr 


Ala 


Ala 


Gly Met 


Ala 


Met 


Ala 


Gly 


Ala 


Gly 


305 






310 








315 










320 


Ser 


Gly 


Phe 


Asp 


Met 


Thr 


Ser Glu 


Ala 


Ala 


Leu 


Ala 


Lys 


Leu 


Ser 


Tyr 




325 








330 








335 


Val 


Leu 


Gly 


Gin 


Pro 


Gly 


Leu Ser 


Leu 


Asp 


Val 


Arg 


Lys 


Glu 


Leu 


Leu 




340 






345 








350 




Thr 


Lys 


Asp 


Leu 


Arg 


Gly 


Glu Met 


: Thr Pro Pro Ser Val . 








355 








360 








3b5 







55 5. The polypeptide of claim 1 , which originates from a member selected from the group consisting of guinea pig and 
human. 

6. The polypeptide of claim 1 , which exists in the form of an oligomer. 
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7. A DNA which encodes the polypeptide as claimed in claim 1 . 

8. The DNA of claim 7, which contains a nucleotide sequence selected from the group consisting of SEQ ID NOs:IO 
to 14, their homologous ones, and complementary ones thereunto; 

5 



SEQ ID NO: 10 

10 ATGGCGCGCG CATCAGGCTC CGAGAGGCAC 

GGCATGCAGA GCAAGGGCGG GGT GCTCGTC 
ACCCTGCCCA TGTTCCATGA CAAGGAGTTC 
CTGGCGCTGC CCCCTGCCAG CCACGGCCCC 
CCCCTCTTGG ATTCCAGCGA CATGACCATC 
GAGAGGCACT ATGAGCAGTA CCAAGGCTTT 

15 TTTGGGGCCT CCATGCTGTC CTTCATGCTG 

GGCGCCCAGG TGCCAATCCG TGTGCTGTGG 
TTGCTTGTGG CCGGCCAATA CATCATCCCT 
TTTCGGGGAA ACCGGGTAAC CAAGGTGGAC 
AATCTGTCCC CACTAGCCAC TGTGGGCGCG 

20 AAGGTCAACT GGAAGGACCC GCTGGTGGTG 

CTGCGCCTCT ACCCTGGCAT CCCGGCCTCC 
AAGGGCGTGG TCCTGGAGAC CTTCGGCTCT 
CAGGAGTTGC GGGCCGCGGC CCAGCGCGGC 
CGGGGGTCTG TGACCCCGGG CTATGCCACG 
TTAGACATGA CCTCAGAGGC CGCGCTGGCT 

25 CTGAGCCTGG AGCGCAGGCA GGAGCTGCTG 

CCCACGGCA 



CTGCTGCTCA TCTACACTGG CGGCACTTTG 60 
CCCGGCCCAG GCCTGGTCAC TCTGCTGCGG 120 
GCCCAGGCCC AGGGCCTCCC TGACCATGCT 180 
AGGGTCCTCT ACACGGTGCT GGAGTGCCAG 240 
GATGATTGGA TTCGCATAGC CAAGATCATA 300 
GTGGTTATCC ACGGCACCGA CACCATGGCC 360 
GAAAACCTGC ACAAACCAGT CATCCTCACT 420 
AATGACGCCC GGGAAAACCT GCTGGGGGCG 480 
GAGGTCTGCC TGTTTATGAA CAGTCAGCTG 540 
TCCCAGAAGT TTGAGGCCTT CTGCTCCCCC 600 
GATGTCACAA TTGCCTGGGA CCTGGTGCGC 660 
CACAGCAACA TGGAGCACGA CGTGGCACTG 720 
CTGGTCCGGG CATTCCTGCA GCCCCCGCTC 780 
GGCAACGGGC CGAGCAAGCC CGACCTGCTG 840 
CTCATCATGG TCAACTGCAG CCAGTGCCTG 900 
AGCTTGGCGG GCGCCAACAT CGTGTCCGGC 960 
AAGCTGTCCT ACGTGTTGGG CCTGCCGGAG 1020 
GCCAAGGATC TTCGCGGGGA AATGACACTG 1080 

1089 



30 

SEQ ID NO: 11: 

ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG 
GGCATGCGGA GTGAGCTCGG CGTGCTTGTG 

35 ACACTGCCCA TGTTCCATGA CGAGGAGCAC 

CTGGTGCTAC CCCCGGACAG CCGCAACCAG 
CCCCTCTTCG ACTCCAGTGA CATGACCATC 
A AG AGGC AC T ACGAGCAGTA CCACGGCTTT 
TTTGCTGCCT CGATGCTGTC CTTCATGCTG 
GGGGCCCAGG TGCCCATCCA TGCCCTGTGG 

40 CTGCTCATGG CTGGCCAGTA TGTGATCCCA 

TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC 
AACCTGCTGC CTCTGGCCAC AGTGGGTGCT 
AAGGTGGACG GGAAGGCTGG GCTGGTGGTG 
CTGCGCCTCT ACCCTGGGAT CCCTGCCGCC 

4S AAGGGCGTGG TCATGGAGAC CTTCGGTTCA 

CAGGAGCTGC GGGTGGCCAC CGAGCGCGGC 
CGGGGGGCTG TGACCACAGA CTATGCAGCT 
TCAGGCTTCG AC ATGAC AT C GGAGGCCGCC 
CCAGGGCTGA GCCTGGATGT CAGGAAGGAG 
ACGCCACCCT CGGTG 

50 



CTGCTGGCCG TCTACACCGG CGGCACCATT 60 

CCCGGGACGG GCCTGGCTGC CATCCTGAGG 120 

GCCCGAGCCC GCGGCCTCTC TGAGGACACC 180 

AGGATCCTCT ACACGGTGCT GGAGTGCCAG 240 

GCTGAGTGGG TTCGCGTTGC CCAGACCATC 300 

GTGGTCATCC ACGGCACCGA CACCATGGCC 360 

GAGAACCTGC AGAAGACTGT CATCCTCACT 420 

AGCGACGGCC GTGAGAACCT GCTGGGGGCA 480 

GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 540 

GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 600 

GACATCACAA TCAACAGGGA GCTGGTGCGG 660 

CACAGCAGCA TGGAGCAGGA CGTGGGCCTG 720 

CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 780 

GGGAACGGAC CCACCAAGCC CGACCTGCTG 840 

CTGGTCATCG TCAACTGTAC CCAGTGCCTC 900 

GGCATGGCCA TGGCGGGAGC CAACGTCATC 960 

CTGGCCAAGC TATCGTATGT GCTGGGCCAG 1020 

CTGCTGACCA AGGACCTTCG GGGGGAGATG 1080 

1095 



55 
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SEQ ID NO: 12: 

ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG 
5 GGCATGCGGA GTGAGCTCGG CGTGCTTGTG 

ACACTGCCCA TGTTCCATGA CGAGGAGCAC 
CTGGTGCTAC CCCCGGACAG CCGCAACCAG 
CCCCTCTTCG ACTCCAGTGA CATGACCATC 
AAGAGGCACT ACGAGCAGTA CCACGGCTTT 



CTGCTGGCCG TCTACACCGG CGGCACCATT 60 

CCCGGGACGG GCCTGGCTGC CATCCTGAGG 120 

GCCCGAGCCC GCGGCCTCTC TGAGGACACC 180 

AGGATCCTCT ACACCGTGCT GGAGTGCCAG 240 

GCTGAGTGGG TTCGCGTTGC CCAGACCATC 300 

GTGGTCATCC ACGGCACCGA CACCATGGCC 360 



TTTGCTGCCT CGATGCTGTC CTTCATGCTG 

1S GGGGCCCAGG TGCCCATCCA TGCCCTGTGG 

CTGCTCATGG CTGGCCAGTA TGTGATCCCA 

TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC 

AACCTGCTGC CTCTGGCCAC AGTGGGTGCT 

AAGGTGGACG GGAAGGCTGG GCTGGTGGTG 

CTGCGCCTCT ACCCTGGGAT CCCTGCCGCC 

20 AAGGGCGTGG TCATGGAGAC CTTCGGTTCA 

CAGGAGCTGC GGGTGGCCAC CGAGCGCGGC 

CGGGGGGCTG TGACCACAGA CTATGCAGCT 

TCAGGCTTCG ACATGACATC GGAGGCCGCC 

CCAGGGCTGA GCCTGGATGT CAGGAAGGAG 
ACGCCACCCT CGGTG 



GAGAACCTGC AGAAGACTGT CATCCTCACT 420 

AGCGACGGCC GTGAGAACCT GCTGGGGGCA 480 

GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 540 

GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 600 

GACATCACAA TCAACAGGGA GCTGGTGCGG 660 

CACAGCAGCA TGGAGCAGGA CGTGGGCCTG 720 

CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 780 

GGGAACGGAC CCACCAAGCC CGACCTGCTG 840 

CTGGTCATCG TCAACTGTAC CCAGTGCCTC 900 

GGCATGGCCA TGGCGGGAGC CGGCGTCATC 960 

CTGGCCAAGC TATCGTATGT GCTGGGCCAG 1020 

CTGCTGACCA AGGACCTTCG GGGGGAGATG 1080 

1095 



30 SEQ ID NO: 13: 

ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG 

GGCATGCGGA GTGAGCTCGG CGTGCTTGTG 

ACACTGCCCA TGTTCCATGA CGAGGAGCAC 

CTGGTGCTAC CCCCGGACAG CCGCAACCAG 

CCCCTCTTCG ACTCCAGTGA CATGACCATC 

AAGAGGCACT ACGAGCAGTA CCACGGCTTT 

TTTGCTGCCT CGATGCTGTC CTTCATGCTG 

GGGGCCCAGG TGCCCATCCA TGCCCTGTGG 

CTGCTCATGG CTGGCCAGTA TGTGATCCCA 

40 TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC 

AACCTGCTGC CTCTGGCCAC AGTGGGTGCT 

AAGGTGGACG GGAAGGCTGG GCTGGTGGTG 

CTGCGCCTCT ACCCTGGGAT CCCTGCCGCC 

AAGGGCGTGG TCATGGAGAC CTTCGGTTCA 

CAGGAGCTGC GGGTGGCCAC CGAGCGCGGC 

45 CAGGGGGCTG TGACCACAGA CTATGCAGCT 

TCAGGCTTCG ACATGACATC GGAGGCCGCC 

CCAGGGCTGA GCCTGGATGT CAGGAAGGAG 
ACGCCACCCT CGGTG 



CTGCTGGCCG TCTACACCGG CGGCACCATT 60 

CCCGGGACGG GCCTGGCTGC CATCCTGAGG 120 

GCCCGAGCCC GCGGCCTCTC TGAGGACACC 180 

AGGATCCTCT ACACCGTGCT GGAGTGCCAG 240 

GCTGAGTGGG TTCGCGTTGC CCAGACCATC 300 

GTGGTCATCC ACGGCACCGA CACCATGGCC 360 

GAGAACCTGC AGAAGACTGT CATCCTCACT 420 

AGCGACGGCC GTGAGAACCT GCTGGGGGCA 480 

GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 540 

GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 600 

GACATCACAA TCAACAGGGA GCTGGTGCGG 660 

CACAGCAGCA TGGAGCAGGA CGTGGGCCTG 720 

CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 780 

GGGAACGGAC CCACCAAGCC CGACCTGCTG 840 

CTGGTCATCG TCAACTGTAC CCAGTGCCTC 900 

GGCATGGCCA TGGCGGGAGC CAACGTCATC 960 

CTGGCCAAGC TATCGTATGT GCTGGGCCAG 1020 

CTGCTGACCA AGGACCTTCG GGGGGAGATG 1080 

1095 



55 
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SEQ ID NO: 14: 

5 ATGGCGCGCG CGGTGGGGCC CGAGCGGAGG CTGCTGGCCG TCTACACCGG CGGCACCATT 60 

GGCATGCGGA GTGAGCTCGG CGTGCTTGTG CCCGGGACGG GCCTGGCTGC CATCCTGAGG 120 

ACACTGCCCA TGTTCCATGA CGAGGAGCAC GCCCGAGCCC GCGGCCTCTC TGAGGACACC 180 

CTGGTGCTAC CCCCGGACAG CCGCAACCAG AGGATCCTCT ACACCGTGCT GGAGTGCCAG 240 

CCCCTCTTCG ACTCCAGTGA CATGACCATC GCTGAGTGGG TTCGCGTTGC CCAGACCATC 300 

10 AAGAGGCACT ACGAGCAGTA CCACGGCTTT GTGGTCATCC ACGGCACCGA CACCATGGCC 360 

TTTGCTGCCT CGATGCTGTC CTTCATGCTG GAGAACCTGC AGAAGACTGT CATCCTCACT 420 

GGGGCCCAGG TGCCCATCCA TGCCCTGTGG AGCGACGGCC GTGAGAACCT GCTGGGGGCA 480 

CTGCTCATGG CTGGCCAGTA TGTGATCCCA GAGGTCTGCC TTTTCTTCCA GAATCAGCTG 540 

TTTCGGGGCA ACCGGGCAAC CAAGGTAGAC GCTCGGAGGT TCGCAGCTTT CTGCTCCCCG 600 

AACCTGCTGC CTCTGGCCAC AGTGGGTGCT GACATCACAA TCAACAGGGA GCTGGTGCGG 660 

15 AAGGTGGACG GGAAGGCTGG GCTGGTGGTG CACAGCAGCA TGGAGCAGGA CGTGGGCCTG 720 

CTGCGCCTCT ACCC TGGG AT CCCTGCCGCC CTGGTTCGGG CCTTCTTGCA GCCTCCCCTG 780 

AAGGGCGTGG TCATGGAGAC CTTCGGTTCA GGGAACGGAC CCACCAAGCC CGACCTGCTG 840 

CAGGAGCTGC GGGTGGCCAC CGAGCGCGGC CTGGTCATCG TCAACTGTAC CCAGTGCCTC 900 

20 

CAGGGGGCTG TGACCACAGA CTATGCAGCT GGCATGGCCA TGGCGGGAGC CGGCGTCATC 9 60 
TCAGGCTTCG ACATGACATC GGAGGCCGCC CTGGCCAAGC TATCGTATGT GCTGGGCCAG 1020 
CCAGGGCTGA GCCTGGATGT CAGGAAGGAG CTGCTGACCA AGGACCTTCG GGGGGAGATG 1080 
25 ACGCCACCCT CGGTG 1095. 

9. The DNA of claim 7, which originates from a member selected from the group consisting of guinea pig and human. 
30 10. A self-replicable vector which contains a DNA encoding the polypeptide as claimed in claim 1 . 

11. The self-replicable vector of claim 10, which is a plasmid vector. 

12. The self-replicable vector of claim 10, which contains one or more members selected from the group consisting 
35 of metallothionein and Tac promotors. 

13. A transformant obtainable by introducing the DNA of claim 7 into a host. 

14. The transformant of claim 13, wherein said host is a nenber selected from the group consisting of prokaryotic and 
40 eukaryotic cells. 

15. The transformant of claim 13, wherein said host is Escherichia coli. 

16. The transformant of claim 1 3, wherein said host is a mouse cell. 

17. A transformant obtainable by the self replicable vector of claim 10 into a host. 



45 



so 



18. The transformant of claim 17, wherein said host is a member selected from the group consisting of prokaryotic 
and eukaryotic cells. 

19. The transformant of claim 17, wherein said host is Escherichia coli 

20. The transformant of claim 17, wherein said host is a mouse cell. 

55 21 . A process for preparing a polypeptide, which comprises (a) artificially expressing a DNA encoding the polypeptide 
of claim 1 , and (b) collecting the polypeptide from the resultant mixture. 

22. The process of claim 21, wherein the artificial expression of the step (a) contains culturing the transformant of 
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claim 13. 

23. The process of claim 21, wherein the artificial expression of the step (a) contains culturing the transformant of 
claim 17. 

5 

24. The process of claim 21 , wherein the resultant mixture of the step (b) is a culture of the transformant of claim 1 3. 

25. The process of claim 21 , wherein the resultant mixture of the step (b) is a culture of the transformant of claim 17. 

10 26. The process of claim 21 , wherein the polypeptide is collected by one or more techniques selected from the group 
consisting of salting out, dialysis, filtration, concentration, gel filtration chromatography, ion-exchange chromatog- 
raphy, affinity chromatography, hydrophobic chromatography, isoelectric focusing and gel electrophoresis. 

27. An agent for susceptive diseases, which contains the polypeptide of claim 1 as an effective ingredient. 

75 

28. The agent of claim 27, wherein said diseases are malignant tumors, leukemias and lymphomas. 

29. The agent of claim 27, which contains one or more members selected from the group consisting of serum albumin, 
glycerol, gelatin, trehalose and maltose as a stabilizer. 
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Note: An asterisk indicates a site where a nucleotide is substituted, 
and a box indicates a polypeptide-encoding sequence. 



EXQJL 



60 



EP0 811 687 A2 



Eco R\ 

Ptac 




F IG T 2 



61 

BNSDOCID: <EP 0811687A2_I_> 



EP0 811 687 A2 



Template DNA pCGPA/WT 

Sense primer 5' -GTGAATTCGGAGGTTCAGATGGCGCGCGCATCA-3' 

Anti-sense primer : 5' -CTGCGGCCGCTCAGATGGCAGGCGGCAC-3' 

i PCR 

Amp I if ied DNA 

I Cleavage by Sco Rl and Not I 

DNA fragment about 1.7 kbp in length 

Linkers 

5' -TCGAGCCACCATGAAGTGTTCGTGGGTTATT-3' 
5' -TTCTTCCTGATGGCCGTAGTGACAGGAGTG-3' 
5' -AATTCACTCCTGTCACTACGGCCATCAGGA-3' 
5' -AGAAAATAACCCACGAACACTTCATGGTGGC-3 1 

1 Phoshpory I at i on 

by T4 polynucleotide kinase 

5' -tertiinal phosphor/ I at ed linkers 



fcoRJ 




I Cleavage by Iho I and Not I 

J 



Li gat ion 




BfVI 

FIG.3 



BNSDOCID: <EP 081 1687A2J_> 



62 




EP0 811 687 A2 



fco Rl xho I 




FIG.4 



BNSDOCfD: <EP 0811687A2J_> 



63 



EP0 811 687 A2 




Eco Rl 




pBR32Zori 



FI G,5 



BNSOOCID: <EP 0811687A2J_> 



64 



EP0 811 687 A2 



Eco Rl 




pBR322ori 



FIG.6 



SNSDOCID: <EP 0811 687 A2_L> 



65 



EP0 811 687 A2 



Eco Rl 




pBR322ori 



FI0 ,7 



BNSDOCID: <EP 0811687A2J_> 



66 



EP0 811 687 A2 



Eco Rl 




pBR322ori 



FIG.8 



BNSDOCID: <EP 0811687A2J_> 



67 



EP0 811 687 A2 



Eco Rl 




pBR322ori 



FIG-9 



BNSDOCID: <EP 0811 687 A2_l_> 



68 



EP0 811 687 A2 



Eco Rl 



Eco Rl 



ig sec Not , 

D364stD^ 



pBR322ori 



Eco Rl 




poly (A) 



BPVI 



FIG.10 



69 



BNSDOCID: <EP 0811687A2J_> 



EP0 811 687 A2 



Eco Rl 



,Xho\ 

Eco Rl 



AmpR 



Pmti, 



pBR322ori 



poly (A) 



pBlgHA/MUTl 



Eco Rl 



BPVI 



FIG.ll 



BNSDOCtD: <EP 0811687A2J_> 



70 




EP0 811 687 A2 



Eco Rl 



Xho I 



pBR322ori 




Eco Rl 



HA/MUT2^ A/ot 1 



poly (A) 



Eco Rl 




BPVI 



FIG.12 



71 

BNSDOCID: <EP 081 1687A2_I_> 



EP0 811 687 A2 





BPVI 

FIG.1 3 



BNSDOCID: <EP 081 1687 A2_l_> 



72 




EP0 811 687 A2 



Eco Rl 



Xho I 



Eco Rl 



pBR322ori 




Not I 



poly (A) 



Eco Rl 




8PVI 



F IG. 14 



BNSDOCID: <EP 081 1687 A2_l_> 

\ 



73 



(19) 



Europaisches P-q^^ftmt 
European Pat nt 
Offi europeen d s brevets 



it Olfic 




(12) 



(11) EP 0 81 1 687 A3 

EUROPEAN PATENT APPLICATION 



(88) Date of publication A3: 

28.10.1998 Bulletin 1998/44 

(43) Date of publication A2: 

10.12.1997 Bulletin 1997/50 

(21) Application number: 97303896.1 

(22) Date of filing: 06.06.1997 



(51) intci* C12N 15/55, C12N 9/82, 
C12N 15/70, C12N 15/79, 
C12N 1/21, C12N 5/10, 
A61 K 38/46 

//(C12N1/21, C12R1:19) 



(84) 


Designated Contracting States: 


• Taniai, Madoka 




AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 


Okayama-shi, Okayama (JP) 




NL PT SE 


• Yamamoto, Kozo 




Designated Extension States: 


Okayama-shi, Okayama (JP) 




AL LT LV RO SI 


* Kurimoto, Mas as hi 






Okayama-shi, Okayama (JP) 


(30) 


Priority: 07.06.1996 JP 168172/96 








(74) Representative: Daniels, Jeffrey Nicholas et al 


(71) 


Applicant: KABUSHIKI KAISHA HAYASHIBARA 


Page White & Farrer 




SEIBUTSU KAGAKU KENKYUJO 


54 Doughty Street 




Okayama-shi Okay a ma (JP) 


London WC1N 2LS (GB) 


(72) 


Inventors: 




• 


Ario, Takeshi 






Okayama-shi, Okayama (JP) 





(54) Polypeptides having l-asparaginase activity 

(57) Disclosed are polypeptides which originate 
from mammal, having L-asparaginase activity. The 
polypeptides are easily prepared by applying recom- 
binant DNA techniques to DNAs encoding the polypep- 
tides and they exert satisfactory effects in the treatment 
and/or the prevention for diseases caused by tumor 
cells dependent on L-asparagine, and cause no sub- 
stantial serious side effects even when administered to 
humans in relatively-high dose. 



DNA with an original nucleotide sequence 



5'-temtinal primer 



Mutagenic primer A 
Mutagenic primer S 

[Firs: step PCRsJ 

i I 



3'-termmaJ primer 



5'-terminal primer 



^'-terminal primer 



CO 
< 

00 
CO 



CO 

o 

Q- 
LU 



[Second step PCR) 

I 



Note: An astexiik indicates a site where a nucleotide is substituted, 
and a box indicates • polypeptide -encoding sequence. 



F1G.1 



Printed by Jouve, 75001 PARIS (FR) 



"ID <EP 08 11 687 A3 J_> 




EP0 811 687 A3 



+ 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 97 30 3896 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document with indication, where appropriate, 
ol relevant passages 



Relevant 
to claim 



CLASSIFICATION OF THE 
APPLICATION' (lnt.CI.6) 



X,P 



EP 0 726 313 A (HAYASHIBARA BIOCHEM LAB) 
14 August 1996 



* the whole document * 

YELLIN T 0 ET AL: "PURIFICATION AND 
PROPERTIES OF GUINIEA PIG SERUM 
ASPARAGINASE" 
BIOCHEMISTRY, 

vol. 5, no. 5, May 1966, pages 1605-1612, 
XP002012974 

* the whole document * 

PATENT ABSTRACTS OF JAPAN 
vol. 004, no. 046 (C-006), 10 April 1980 
& JP 55 019018 A (T0UBISHI YAKUHIN K0GYO 
KK) , 9 February 1980 

* abstract * 

PATENT ABSTRACTS OF JAPAN- 
vol. 017, no. 158 (C-1041), 29 March 1993 
& JP 04 320684 A (JAPAN SYNTHETIC RUBBER 
CO LTD), 11 November 1992 % . 

* abstract* 



The present search report has been drawn up for alt claims 



1-11, 

13-15, 

17-19, 

21-25, 

27,28 



1-29 



C12N15/55^ 
C12N9/82 
C12N15/70 *' 
C12N15/79 
C12N1/21 
C12N5/10 
A61K38/46 
//(C12N1/21, 
C12R1:19) 



1,2,4-29 



1,2,4-29 



TECHNICAL FIELDS 
SEARCHED (lnt.CI.6) 



C12N 



Ptec« ol March 

THE HAGUE 



Data ol comptotioo ot the March 

7 September 1998 



Exa minor 

Fernandez y Branas,F 



CATEGORY OF CITED DOCUMENTS 

X : particularly relevant a taken alone 

Y : particularly relevant If combined with another 

document ol the Berne category 
A : technological background 
O : non-written disclosure 
P : intermediate document 



T : theory or principle underlying the Invention 
E : earlier patent document, but published on, or 

after the Tiling date 
D : document cited in the application 
L : document cited for other reasons 



A : member or the same patent family, corresponding 
document 



2 



